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^ (54) Title: SUBUNIT VACCINES WITH A2 SUPERMOTIFS 

m 

^ (57) Abstract: Methods to design vaccines which are effective in individuals bearing A2 supertype alleles are described. Single 
amino acid substitution analogs of known A2-supeitype binding peptides, and large peptide libraries were utilized to rigorously 
define the peptide binding specificities of A2-supertype molecules. While each molecule was noted to have unique preferences, 
large overlaps in specificity were found. The presence of the hydrophobic and aliphatic residues L, I, V, M, A, T, and Q in posiLon 2 
of peptide ligands was commonly tolerated by A2-supert)'pe molecules. L, I, V, M, A, and T were tolerated at the C-terminus. While 
examination of secondary influences on peptide binding revealed allele specific preferences, shared features could also be identified, 
and were utilized to define an A2-supermotif. Shared features also correlate with cross-reactivity; over 70% of the peptides that 
bound A*020l with high aflBnity were found to bind at least 2 other A2-supertype molecules. Finally, the coefficients for use in the 
development of algorithnis for the prediction of peptide binding to A2-supertype molecules are provided. 
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SUBUNIT VACCINES WITH A2 SUPERMOTIFS 



Technical Field 

Subject matter disclosed herein relates to the design of vaccines which will be effective in 
large portions of the population, in particular, those members of the population who are 
5 characterized as having an A2 supertype allele. Subunit vaccines which comprise the A2 
supermotif can be designed to effect such population coverage. 

Background 

The genetic makeup of a given mammal encodes the structures associated with the 
immune system of that species. Althougji there is a great deal of genetic diversity in the human 

10 population, even more so comparing humans and other species, there are also common features 
and efifects. In mammals, certain molecules associated with immune function are termed the 
major histocompatibility complex. 

MHC molecules are classified as either Class I or Class 11 molecules. Class II MHC 
molecules are expressed primarily on cells involved in initiating and sustaining immune 

15 responses, such as T lymphocytes, B lymphocytes, macrophages, etc. Class 11 MHC molecules 
are recognized by helper T lymphocytes and induce proliferation of helper T lymphocytes and 
amplification of the immune response to the particular immunogenic peptide that is displayed. 
Class I MHC molecules are expressed on almost all nucleated cells and are recognized by 
cytotoxic T lymphocytes (CTLs), which then destroy the antigen-bearing cells. CTLs are 

20 particularly important in tumor rejection and in fighting viral infections. 

CTLs recognize the antigen in the form of a peptide fi-agment bound to the MHC class I 
molecules rather than the intact foreign antigen itself. The antigen must normally be 
endogenously synthesized by the cell, and a portion of the protein antigen is degraded into small 
peptide fragments in the cytoplasm. Some of these small peptides translocate into a pre-Golgi 

25 compartment and interact with class I heavy chains to facilitate proper folding and association 
with the subunit ffl microglobulin. The peptide-MHC class I complex is then routed to the cell 
surface for expression and potential recognition by specific CTLs. 

Investigations of the crystal structure of the human MHC class I molecule, HLA-A2.1, 
indicate that a peptide binding groove is created by the folding of the al and oQ, domains of the 

30 class I heavy chain (Bjorkman, ei aL, Nature 329:506 ( 1 987)). In these investigations, however, 
the identity of peptides bound to the groove was not determined. 
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Buus, et al, Science 242:1065 (1988) first described a method for acid elution of bound 
peptides from MHC. Subsequently, Rammensee and his coworkers (Falk, et ai.. Nature 351 :290 
(1991)) have developed an approach to characterize naturally processed peptides bound to class I 
molecules. Other investigators have successfully achieved direct amino acid sequencing of the 
5 more abundant peptides in various HPLC fractions by conventional automated sequencing of 
peptides eluted from class I molecules of the B type (Jardetzky, et al,. Nature 353:326 (1991)) 
and of the A2.1 type by mass spectrometry (Hunt, et al. Science 225: 1261 (1992)). A review of 
the characterization of naturally processed peptides in MHC Class I has been presented by 
Rotzschke & Falk (Rotzschke & Falk, Immunol Today 12:447 (1991)). PCX publication 
1 0 WO 97/3462 1 , incoiporated herein by reference, describes peptides which have a binding motif 
for A2.1 alleles. 

Sette, et aL, Proc. Nat'L Acad. Sci. USA 86:3296 (1989) showed that MHC allele 
specific motifs could be used to predict MHC binding capacity. Schaeffer, et al, Proc. Nat 1 
Acad, Sci, USA 86:4649 (1989) showed that MHC binding was related to immunogenicity. 

15 Others (De Bruijn, et al, Eur. J. Immunol, 21:2963-2970 (1991); Pamer, et a/., 991 Nature 
353:852-955 (1991)) have provided preliminary evidence that class I binding motifs can be 
applied to the ^^i^^ immunogenic peptides in animal rnodels. Class I motifs 

specific for a number of human alleles of a given class I isotype have yet to be described. It is 
desirable that the combined frequencies of these different alleles should be high enough to cover 

20 a large fraction or perhaps the majority of the human outbreed population- 

Despite the developments in the art, the prior art has yet to provide a usefril human 
peptide-based vaccine or therapeutic agent based on this work. 

Summary 

The invention provides the parameters for the design of vaccines which are expected to 
25 effectively target large portions of the population. Following the guidance set forth herein, to 
prepare vaccines with respect to a particular infectious organism or virus or tumor, the relevant 
antigen is assessed to determine the location of epitopes which are most likely to effect a 
cytotoxic T response to an infection or tumor. By analyzing the amino acid sequence of the 
antigen according to the methods set forth herein, an appropriate set of epitopes can be 
30 identified. Peptides which consist of these epitopes can readily be tested for their ability to bind 
one or more HLA alleles characteristic of the A2 supertype. In general, peptides which bind 
with an affinity represented by an IC50 of 500 nM or less have a high probability of eliciting a 
cytotoxic T lymphocyte (CTL) response. The ability of these peptides to do so can also readily 
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be verified. Vaccines can then be designed based on the immunogenic peptides thus identified. 
The vaccines themselves can consist of the peptides per se, precursors which will be expected to 
generate the peptides in vivo, or nucleic acids encoding these peptides for production in vivo. 

Thus, in one aspect, the invention is directed to a method for identifying an epitope in an 
5 antigen characteristic of a pathogen or tumor. The epitope identified by this method is more 
likely to enhance an immune response in an individual bearing an allele of the A2 supertype than 
an arbitrarily chosen peptide. The method comprises analyzing the amino acid sequence of the 
antigen for segments of 8-1 1 amino acids, where the amino acid at position 2 is a small or . 
aliphatic hydrophobic residue (L, I, V, M, A, T or Q) and the amino acid at the C-terminus of the 

10 segment is also a small or aliphatic hydrophobic residue (L, I, V, M, A or T). In preferred 

embodiments, the residue at position 2 is L or Mv In other preferred embodiments, the segment 
contains 9-10 amino acids. In another preferred embodiment, the segment contains Q or N at 
position 1 and/or R, H or K at position 8, and lacks a D, E and G at position 3 when the segment 
is a lO-mer. Also preferred is V at position 2 and at the C-terminus, 

1 5 Also described herein are compositions comprising inmiunogenic peptides having 

binding motif subsequences for HLA-A2.1 molecules. The immimogenic epitopes in the 
peptides, which bind to the appropriate MHC allele, are preferably 8-11 residues in length and 
more preferably 9 to 10 residues in length and comprise conserved residues at certain positions 
such as positions 2 and the C-terminus. Moreover, the peptides do not comprise negative 

20 binding residues as defined herein at other positions such as positions 1, 3, 6 and/or 7 in the case 
of peptides 9 amino acids in length and positions 1, 3, 4, 5, 7, 8 and/or 9 in the case of peptides 
10 amino acids in length. The present invention defines positions within a motif enabling the 
selection of peptides which will bind efficiently to HLA A2. 1 . 

Epitopes on a number of immunogenic target proteins can be identified using the 

25 sequence motifs described herein. Examples of suitable antigens include prostate cancer specific 
antigen (PSA), hepatitis B core and surface antigens (HBVc, HBVs) hepatitis C antigens, 
Epstein-Barr virus antigens, human immunodeficiency type-1 virus (HIVl), Kaposi's sarcoma 
herpes virus (KSHV), human papilloma virus (HPV) antigens, Lassa virus, mycobacterium 
tuberculosis (MT), p53, CEA, trypanosome surface antigen (TSA) and Her2/neu. The peptides 

30 and nucleic acids encoding them are usefiil in pharmaceutical compositions for both in vivo and 
ex vivo therapeutic and diagnostic applications. 
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Definitions 

The term "peptide" is used interchangeably with "oligopeptide" in the present 
specification to designate a series of residues, typically L-amino acids, connected one to the 
other typically by peptide bonds between the alpha-amino and carbonyl groups of adjacent 
5 amino acids. The oligopeptides are generally less than 250 amino acids in length, and can be 
less than 150, 100, 75, 50, 25, or 15 amino acids in length. Further, an oligopeptide of the 
invention can be such that it does not comprise more than 15 contiguous amino acids of a native 
antigen. 

The nomenclature used to describe peptide compounds follows the conventional practice 
10 where the amino group is presented to the left (the N-terminus) and the carboxyl group to the 
right (the C-terminus) of each amino acid residue. In the formulae representing selected specific 
embodiments of the present invention, the amino- and carboxyl-terminal groups, although not 
specifically shown, are in the fonn they would assume at physiologic pH values, unless 
otherwise specified. In the amino acid structure formulae, each residue is generally represented 
1 5 by standard three letter or single letter designations. The L-form of an amino acid residue is 
represented by a capital single letter or a capital first letter of a three-letter symbol, and the D- 
form for those amino acids having D-forms is represented by a lower case single letter or a lower 
case three letter symbol. Glycine has no asynmietric carbon atom and is simply referred to as 
"Gly"orG. 

20 An "immunogenic peptide" or "epitope" is a peptide or amino acid sequence which 

comprises an allele-specific motif such that the peptide sequence will bind an MHC molecule 
and induce a CTL response. Immunogenic peptides of the invention are capable of binding to an 
appropriate HLA-A2 molecule and inducing a cytotoxic T-cell response against the antigen fi-om 
which the immunogenic peptide is derived. The immunogenic peptides of the invention are less 

25 than about 1 5 residues in length, often less than 12 residues in length and usually consist of 
between about 8 and about 1 1 residues, preferably 9 or 10 residues. 

Immunogenic peptides are conveniently identified using the algorithms of the invention. 
The algorithms are mathematical procedures that produce a score which enables the selection of 
immunogenic peptides. Typically one uses the algorithmic score with a "binding threshold" to 

30 enable selection of peptides that have a high probability of binding at a certain affinity and will 
in turn be immunogenic. The algorithm is based upon either the effects on MHC binding of a 
particular amino acid at a particular position of a peptide or the effects on binding of a particular 
substitution in a motif containing peptide. 
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Binding results are often expressed in terms of "ICso's/* IC50 is the concentration of 
peptide in a binding assay at which 50% inhibition of binding of a reference peptide is observed. 
Given the conditions in which the assays as described herein are run (i.e., limiting HLA proteins 
and labeled peptide concentrations), these values approximate Kd values. Assays for 
5 determining binding are described in detail in PCT publications WO 94/20127 and WO 
94/03205. It should be noted that IC50 values can change, often dramatically, if the assay 
conditions are varied, and depending on the particular reagents used HLA preparation, 
etc.y For example, excessive concentrations of HLA molecules will increase the apparent 
measured IC50 of a given Ugand and therefore not reflect the true Kd value. 

10 Binding is often expressed as a ratio relative to a reference peptide. As a particular assay 

becomes more, or less, sensitive, the ICso's of the peptides tested may change somewhat. 
However, the binding relative to the reference peptide will not significantly change. For 
example, in an assay run under conditions such that the IC50 of the reference peptide increases 
10-fold, the IC50 values of the test peptides will also shift approximately 10-fold. Therefore, to 

1 5 avoid ambiguities, the assessment of whether a peptide is a good, intermediate, weak, or negative 
binder is generally based on its ICso, relative to the IC50 of a standard peptide. The binding may 
be reported as a ratio or the ratio may be used to normalize the IC50 value as described in 
Example 1. 

As used herein, high affinity with respect to HLA class I molecules is defined as binding 
20 with an IC50 or Ko value of less than 50 nM. Intermediate afBnity is binding with an IC50 (or 
Kd) of between about 50 and about 500 nM. 

A "conserved residue" is an amino acid which occurs in a significantly higher fi-equency 
than would be expected by random distribution at a particular position in a peptide. Typically a 
conserved residue is one where the MHC structure may provide a contact point with the 
25 immunogenic peptide. One to three, preferably two, conserved residues within a peptide of 

defined length defines a motif for an immunogenic peptide. These residues are typically in close 
contact with the peptide binding groove, with their side chains buried in specific pockets of the 
groove itself. Typically, an immunogenic peptide will comprise up to three conserved residues, 
more usually two conserved residues. 
30 As used herein, "negative binding residues" are amino acids which if present at certain 

positions (for example, positions 1, 3 and/or 7 of a 9-mer) will result in a peptide being a 
nonbinder or poor binder and in turn fail to be immunogenic i.e. induce a CTL response. 

The term "motif refers to the pattem of residues in a peptide of defined length, usually 
about 8 to about 1 1 amino acids, which is recognized by a particular MHC allele. The peptide 
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motifs are typically different for each human MHC allele and differ in the pattern of the highly 
conserved residues and negative residues. 

The binding motif for an allele can be defined with increasing degrees of precision. In 
one case, all of the conserved residues are present in the correct positions in a peptide and there 
5 are no negative residues in positions 1,3 and/or 7. 

A "supermotif ' is a peptide binding specificity shared by HLA molecules encoded by two 
or more HLA alleles. A supermotif-bearing epitope preferably is recognized with high or 
intermediate affinity (as defined herein) by two or more HLA antigens. 

An "HLA supertype or family", as used herein, describes sets of HLA molecules grouped 
10 on the basis of shared peptide-binding specificities. HLA class I molecules that share somewhat 
similar binding affinity for peptides bearing ceilain amino acid m 
supertypes. The tenns HLA superfamily, HLA supertype family, and HLA xx-li^ 
molecules (where XX d^otes a particular HLA type) are synonym 

The phrases "isolated" or "biologically pure" refer to material which is substantially or 
15 essentially free from components which normally accompany it as found in its native state. 
Thus, the peptides of this invention do not contain materials normally associated with their in 
situ environment, e.g., MHC I molecules on antigen presenting cells. Eyen where a protein has 
been isolated to a homogenous or dominant band^ there are trace contaminants in the range of 5- 
1 0% of native protein which co-purify with the desired protein. Isolated peptides of this 
20 invention do not contain such endogenous co-purijFied protein. 

The term "residue" refers to an amino acid or amino acid mimetic incorporated in an 
oligopeptide by an amide bond or amide bond mimetic. 

Brief Description of the Drawings 

Figure 1 . Position 2 and G-terminus fine specificity of HLA-A*020 1 . The preference for 
25 specific residues in position 2(a) or at the C-terminus (b) is shown at a function of the percent of 
peptides bearing a specific residue that bind A*020 1 with IC50 of 500 nM or better. ARB values 
of peptides bearing specific residues in position 2 (a) or at the C-terminus (b) were calculated as 
described herein, and indexed relative to the residue with the highest binding capacity. The 
average (geometric) binding capacity of peptides with L in position 2 was 1991 nM. The 
30 average (geometric) binding capacity of peptides with V at the C-terminus was 2133 nM. 
Peptides included in the analysis had at least one tolerated anchor residue, as described in the 
text, at either position 2 or the C-terminus. 
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Figure 2 . Map of the A*0201 motif. Summary map of the A*0201 motif for 8-mer (b), 
lO-mer (c) and 1 1-mer (d) peptides. At secondary anchor positions, residues shown as preferred 
(or deleterious) are associated with an average binding capacity at least 3 -fold greater than (or 3- 
fold less than) peptides of the same size carrying other residues at the same position. At the 
5 primary anchor positions, preferred residues are those associated with an average binding 
capacity within 10-fold of the optimal residue at the same position. Tolerated primary anchor 
residues are those associated with an average binding capacity between 10- and 100-fold of the 
optimal residue at the same position. 

Figure 3 . Position 2 fine specificity of HLA-A2-supertype molecules. ARB values of 
10 peptides bearing specific residues in position 2 were calculated for each A2-supertype molecule 
as described in the text, and indexed relative to the residue with the highest ARB for each 
specific molecule. The average (geometric) binding capacity of the peptides bearing the residue 
with the highest ARB were 55, 59, 89, and 41nM for A*0202, A*0206, and A*6802, 
respectively. 

15 Figure 4 . C-terminal fine specificity of HLA-A2-supertype molecules. ARB values of 

pq)tides bearing specific residues at the C-terminus were calculated for each A2-supertype 
tnolecule as described in the text, and indexed relative to the residue with the hi^est ARB for 
each specific molecule. The average (geometric) binding capacity of the peptides bearing the 
residue with the highest ARB were 291, 48, 250, and 553 nM for A*0202, A*0203, A*0206, and 

20 A*6802, respectively. 

Figure 5 , Map of the A*0202 motif Summary map of A*0202 motif for 9-mer (a) and 
10-mer (b) peptides. At secondary anchor positions, residues shown as preferred (or deleterious) 
are associated with an average binding capacity at least 3-fold greater than (or 3-fold less than) 
peptides of the same size cairying other residues at the same position. At the primary anchor 

25 positions, preferred residues are those associated with an average binding capacity within 10-fold 
of the optimal residue at the same position. Tolerated primary anchor residues are those 
associated with an average binding capacity between 10- and 100-fold of the optimal residue at 
the same position. 

Figure 6 . Map of the A*0203 motif. Summary maps ofA*0203 motif for 9-mer (a) and 
30 lO-mer (b) peptides. At secondary anchor positions, residues shown as preferred (or deleterious) 
are associated with an average binding capacity at least 3-fold greater than (or 3-fold less than) 
peptides of the same size carrying other residues at the same position. At the primary anchor 
positions, preferred residues are those associated with an average binding capacity within 10-fold 
of the optimal residue at the same position. Tolerated primary anchor residues are those 
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associated with an average binding capacity between 10- and 100-fold of the optimal residue at 
the same position. 

Figure 7 . Map of the A*0206 motif. Summary maps of A*0206 motif for 9-mer (a) and 
1 0-mer (b) peptides. At secondary anchor positions, residues shown as preferred (or deleterious) 
5 are associated with an average binding capacity at least 3-fold greater than (or 3-fold less than) 
peptides of the same size carrying other residues at the same position. At the primary anchor 
positions, preferred residues are those associated with an average binding capacity within 10-fold 
of the optimal residue at the same position. Tolerated primary anchor residues jure those 
associated with an average binding capacity between 10- and 100-fold of the optimal residue at 

10 the same position. 

Figure 8 . Map of the A*6802 motif: Sununary maps of A*6802 motif for 9-mer (a) and 
10-mer (b) peptides. At secondary anchor positions, residues shown as preferred (or deleterious) 
are associated with an average binding capacity at least 3-foId greater than (or 3-fold less than) 
peptides of the same size carrying other residues at the same position. At the primary anchor 

15 positions, preferred residues are those associated with an average binding capacity within lO^fold 
of the optimal residue at the same position. Tolerated primary anchor residues are those 
associated with an ay er^ge binding capacity between 10- and 100- fold of the optimal residue at 
the same position. 

Figure 9 . A2 supermotif consensus sumniary of secondary and primary anchor 
20 influences on A2-supertype binding capacity of 9-(a) and 10-mer (b) peptides. Residues shown 
significantly influence binding to 3 or more A2-supertype molecules. The nimiber of molecules 
influenced are indicated in parentheses. At secondary anchor positions, residues are considered 
preferred only if they do not have a deleterious influence on more than one molecule. Preferred 
residues which were deleterious in the context of one molecule are indicated by reduced and 
25 italicized font. Assessment at the primary anchor positions are based on single substitution and 
peptide library analyses, as discussed in the text. 

Description of the Preferred Embodiments 

The present invention relates in part to an epitope-based approach for vaccine design. 
Such an approach is based on the well-established finding that the mechanism for inducing CTL 
30 immune response comprises the step of presenting a CTL epitope as a peptide of about 8-11 
amino acids bound to aii HLA molecule displayed on an antigen-presenting cell. The HLA 
molecule is the product of a class I MHC wherein the product is expressed on most nucleated 
cells. 
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The products of the MHC class I alleles are generically characterized as A« B and C HLA 
molecules. Within each of these categories, there is a multiplicity of allelic variants in the 
population; indeed, there are believed to be well over 500 class I and class n alleles. Since a 
cytotoxic T-cell response cannot be elicited unless the epitope is presented by the class I HLA 
5 contained on the surface of the cells of the individual to be immunized, it is important that the 
epitope be one that is capable of binding the HLA exhibited by that individual. 

The starting point, therefore, for the design of effective vaccines is to ensure that the 
vaccine will generate a large number of epitopes that can successfully be presented. It may be 
possible to administer the peptides representing the epitopes per se. Such administration is 
1 0 dependent on the presentation of "empty*' HLA molecules displayed on the cells of the subject. 
In one approach to use of the immunogenic peptides per je, these peptides may be incubated 
with antigen-presenting cells from the subject to be treated ex vivo and the cells then returned to 
the subject. 

Alternatively, the 8-11 amino acid pq}tide can be generated in situ by administering a 

IS nucleic acid containing a nucleotide sequence encoding it. Means for providing such nucleic 
acid molecules are described in WO 99/58658, the disclosure of which is incorporated herein by 
reference. Further, the immunogenic peptides can be administered as portions of a larger peptide 
molecule and cleaved to release the desired peptide. The larger peptide may contain extraneous 
amino acids, in general the fewer the better. Thus, peptides which contain such amino acids are 

20 typically 25 amino acids or less, more typically 20 amino acids or less, and more typically 

15 amino acids or less. The precursor may also be a heteropolymer or homopolymer containing 
a multiplicity of different or same CTL epitopes. Of course, mixtiu-es of peptides and nucleic 
acids which generate a variety of immunogenic peptides can also be employed. The design of 
the peptide vaccines, the nucleic acid molecules, or the hetero- or homo-polymers is dependent 

25 on the inclusion of the desired epitope. The present invention provides a paradigm for 
identifying the relevant epitope which is effective across the broad population range of 
individuals who are characterized by the A2 supertype. The following pages describe the 
methods and results of experiments for identification of the A2 supermotif. 

It is preferred that peptides include an epitope that binds to an HLA-A2 supertype allele. 

30 These motifs may be used to define T-cell epitopes from any desired antigen, particularly those 
associated with human viral diseases, cancel^ or autoimmune diseases, for which the amino acid 
sequence of the potential antigen or autoantigen targets is known. 

Epitopes on a number of potential target proteins can be identified based upon HLA 
binding motifs. Examples of suitable antigens include prostate specific antigen (PSA), hepatitis 
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B core and surface antigens (HB Vc, HBVs) hepatitis C antigens, Epstein-Barr virus antigens, 
melanoma antigens (e.g., MAGE-1), human immunodeficiency virus (HIV) antigens, human 
papilloma virus (HPV) antigens, p53, CEA, trypanosome surface antigen (TSA), and Her2/neu. 
Peptides comprising the epitopes from these antigens may be synthesized and then tested 
5 for their abiUty to bind to the appropriate MHC molecules in assays using, for example, purified 
class I molecules and radioiodonated peptides and/or cells expressing empty class I molecules 
by, for instance, immunofluorescent staining and flow microfluorometry, peptide-dependcnt 
class I assembly assays, and inhibition of CTL recognition by peptide competition. Those 
peptides that bind to the class I molecule may be further evaluated for their abiUty to serve as 

1 0 targets for CTLs derived from infected or immunized individuals, as well as for their capacity to 
induce primary in vitro or in vivo CTL responses that can give rise to GTL populations capable 
of reacting with virally infected target ceils or tumor cells as potential therapeutic agents. 

The MHC class I antigens are encoded by the HLA-A, B, and C loci. HLA-A and B 
antigens are expressed at the cell surface at ^proximately equal densities, whereas the 

15 expression of HLA-C is significantly lower (perhaps as much as 10-fold lower). Each of these 
loci have a number of alleles. The peptide binding motifs of the invention are relatively specific 
for each allelic subtype. 

For peptide-based vaccines, peptides preferably comprise a motif recognized by an MHC 
I molecule having a wide distribution in the human population, or comprise a motif recognized 

20 by a genetically diverse population. Since the MHC alleles occur at different frequencies within 
different ethnic groups and races, the choice of target MHC allele may depend upon the target 
population. Table 1 shows the frequency of various alleles at the HLA-A locus products among 
different races. For instance, the majority of the Caucasoid population can be covered by 
peptides which bind to four HLA-A allele subtypes, specifically HLA-A2.1, Al, A3.2, and 

25 A24. 1 . Similarly, the majority of the Asian population is encompassed with the addition of 
peptides binding to a fifth allele HLA-Al 1 .2. 
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TABLE I 



A Allele/Subtvoe 


Nf69)* 


A(54) 


a502> 


Al 


10.1(7) 


1.8(1) 


27.4(138) 


A2.1 


11.5(8) 


37.0(20) 


39.8(199) 


A2.2 


10.1(7) 


0 


3.3(17) 


A23 


1.4(1) 


5.5(3) 


0.8(4) 


A2.4 


- 


- 


- 


A2.5 


- 


- 


- 


A3.1 


1.4(1) 


0 


0.2(0) 


A3.2 


5.7(4) 


5.5(3) 


21.5(108) 


All.l 


0 


5.5(3) 


0 


A11.2 


5.7(4) 


31.4(17) 


8.7(44) 


A11.3 


0 


3.7(2) 


0 


A23 


4.3(3) 


- 


3.9(20) 


A24 


2.9(2) 


27.7(15) 


15.3(77) 


A24.2 


- 


- 


- 


A24.3 




- 


- 


A25 


1.4(1) 


- 


6.9(35) 


A26.1 


4.3(3) 


9.2(5) 


5.9(30) 


A26.2 


7.2(5) 


- 


1.0(5) 


A26V 


- 


3.7(2) 


- 


A28.1 


10.1(7) 


- 


1.6(8) 


A28.2 


1.4(1) 


- 


7.5(38) 


A29.1 


1.4(1) 


- 


1.4(7) 


A29.2 


10.1(7) 


1.8(1) 


5.3(27) 


A30.1 


8.6(6) 


- 


4.9(25) 


A30.2 


1.4(1) 


- 


0.2(1) 


A30.3 


7.2(5) 




3.9(20) 


A31 


4.3(3) 


7.4(4) 


6.9(35) 


A32 


2.8(2) 




7.1(36) 


Aw33.1 


8.6(6) 




2.5(13) 


Aw33.2 


2.8(2) 


16.6(9) 


1.2(6) 


Aw34,l 


1.4(1) 






Aw34.2 


14.5(10) 




0.8(4) 


Aw36 


5.9(4) 







Table compiled from B. DuPont, Immunobiology of HLA . Vol. I, Histocompatibility Testing 
1 987, Springer-Verlag, New York 1 989. 

* N = Negroid; A = Asian; C = Caucasoid. Numbers in 
parenthesis represent the number of individuals included in the analysis. 



Cross-reactive binding of HLA-A2.1 motif-bearing peptides with other HLA-A2 allele- 
specific molecules can occur. Those allele-specific molecules that share binding specificities 
with HLA;A2.l are deemed to comprise the HLA-A2.1 supertype. The B pocket of A2 
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supertype HLA molecules is characterized by a consensus motif including residues (this 
nomenclature uses single letter amino acid codes, where the subscript indicates peptide position) 

A24, M45, E/N63, K/N66, V67, H/Q70 and Y/G99. Similarly, the A2-supertype F pocket is 
characterized by a consensus motif including residues D77, Tgo, Lgi and Yne (155). About 66% 
5 of the peptides binding A*0201 will be cross-reactive amongst three or more A2-supertype 
alleles. 

The A2 supertype as defined herein is consistent with cross-reactivity data, (Fruci, D. et 
al. Hum, Immunol. 38:187, 1993), from live cell binding assays (del Guercio, M.-F. et al., X 
Immunol 1 54:685, 1 995) and data obtained by sequencing naturally processed peptides (Sudo, 

10 T., et aL, 7. Immunol 1 55:4749, 1 995) bound to HLA-A2 allele-specific molecules. 

Accordingly the family of HLA molecules (i.e, the HLA-A2 supertype that binds these peptides) 
is comprised of at least nine HLA-A proteins: A*0201 , A*0202, A*0203, A*0204, A*0205, 
A*0206, A*0207, A*6802, and A*6901 . 

As described herein, the HLA-A2 supermotif comprises peptide ligands with L, I, V, M, 

15 A, T, or Q as primary anchor residues at position 2 and L, I, V, M, A^ or T as a primary anchor 
residue at the C-terminal position of the epitope. HLA-A2 motifs that are most particularly 
relevant to the invention claimed here comprise V, A, T, or Q at position two and L, I, V, M, A, 
or T at the C-terminal anchor position. A peptide epitope comprising an HLA-A2 supermotif 
may bind more than one HLA-A2 supertype molecule. 

20 A procedure which may be used to identify peptides of the present invention is disclosed 

in Falk, et al. Nature 351 :290 (199 1), incorporated herein by reference. Briefly, the methods 
involve large-scale isolation of MHC class I molecules, typically by immunoprecipitation or 
affinity chromatography, from appropriate cell or cell line. Examples of other methods for 
isolation of the desired MHC molecule equally well known to the artisan include ion exchange 

25 chromatography, lectin chromatography, size exclusion, high performance ligand 
chromatography, and a combination of all of the above techniques. 

In a typical case, immunoprecipitation may be used to isolate the desired allele. A 
number of protocols can be used, depending upon the specificity of the antibodies used. For 
example, allele-specific mAb reagents can be used for the affinity purification of the HLA-A, 

30 HLA-B, and HLA-C molecules. Several mAb reagents for the isolation of HLA-A molecules are 
available. The monoclonal BB7.2 is suitable for isolating HLA-A2 molecules. Affinity colimms 
prepared with these mAbs using standard techniques are successfully used to purify the 
respective HLA-A allele products. 
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In addition to allele-specific mAbs, broadly reactive anti-HLA-A, B, C mAbs, such as 
W6/32 B9.12.1 and B 1. 23.2, could be used in alternative affinity purification protocols as 
described in the example section below. 

The peptides bound to the peptide binding groove of the isolated MHC molecules are 
5 eluted typically using acid treatment. Peptides can also be dissociated from class I molecules by 
a variety of standard denatxuing means, such as heat, pH, detergents, salts, chaotropic agents, or 
a combination thereof. 

Peptide fractions are further separated from the MHC molecules by reversed-phase high 
performance liquid chromatography (HPLC) and sequenced. Peptides can be separated by a 
10 variety of other standard means well known to the artisan, including filtration, ultrafiltration, 
electrophoresis, size chromatography, precipitation with specific antibodies, ion exchange 
chromatography, isoelectrofocusing, and the like. 

Sequencing of the isolated peptides can be performed according to standard techniques 
such as Edman degradation (Hunkapiller, M.W., et a/.. Methods EnzymoL 91, 399 [1983]). 
1 S Other methods suitable for sequencing include mass spectrometry sequencing of individual 
peptides as previously described (Hunt, et al. Science 225:1261 (1992), which is incorporated 
herein by reference). Amino acid sequencing of bulk heterogeneous peptides (e.g., pooled 
HPLC fractions) from different class I molecules typically reveals a characteristic sequence 
niptif for each class I allele. 
20 Definition of motifs specific for different class I alleles allows the identification of 

potential peptide epitopes from an antigenic protein whose amino acid sequence is known. 
Typically, identification of potential peptide epitopes is initially carried out using a computer to 
scan the amino acid sequence of a desired antigen for the presence of motifs. 

Following identification of motif-bearing epitopes, the epitopic sequences are then 
25 synthesized. The capacity to bind MHC Class molecules is measured in a variety of different 
ways. One means is a Class I molecule binding assay as described in the related applications, 
noted below. Other alternatives described in the literature include inhibition of antigen 
presentation (Sette, et aL, 1 Immunol 141:3893 (1991), in vitro assembly assays (Townsend, et 
ah. Cell 62:285 (1990), and FACS based assays using mutated cells, such as RMA.S (Melief, et 
30 al, Eur, J, Immunol 21 :2963 (1991)). 

As disclosed herein, higher HLA binding affinity is correlated with greater 
immunogenicity. Greater immuiiogenicity can be manifested in several different ways. 
Immunogenicity can correspond to whether an immune response is elicited at all, and to the 
vigor of any particular response, as well as to the extent of a diverse population in which a 

13 
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response is elicited. For example, a peptide might elicit an immune response in a diverse array 
of the population, yet in no instance produce a vigorous response. In accordance with the 
principles disclosed herein, close to 90% of high binding peptides have been found to be 
immunogenic, as contrasted v^ith about 50% of the peptides which bind with intermediate 
5 affinity. Moreover, higher binding affinity peptides lead to more vigorous inununogenic 
responses. As a result, less peptide is required to elicit a similar biological effect if a high 
affinity binding peptide is used. Thus, in preferred embodiments of the invention, high affinity 
binding epitopes are particularly useful. Nevertheless, substaiitial improvements over the prior 
art are achieved with intermediate or high binding peptides. 

10 The relationship between binding affinity for HLA class I molecules and immunogenicity 

^ of discrete peptide epitopes has been determined for the first time in the art by the present 
inventors. In these experiments, in which discrete peptides were referred to, it is to be noted that 
cellular processing of peptides in vivo will lead to such peptides even if longer fi^gnaents are 
used. Accordingly, longer peptides comprising one or more epitopes are within the scope of the 

1 5 invention. The correlation between binding affinity and inmiunogenicity was analyzed in two 
different experimental approaches (Sette, et al, J. Immunol 153:5586-5592, 1994). In the first 
approach, the iirununogenid^ potential epitopes ranging in HLA binding affinity over a 
10,000-fold range was analyzed in HLA-A*0201 transgenic mice. In the second approach, the 
antigenicity of approximately 100 different hepatitis B virus (HBV)-derived potential epitopes, 

20 all carrying A*0201 binding motifs, was assessed by using PEL (peripheral blood lymphocytes) 
fi*om acute hepatitis patients. Pursuant to these approaches, it was determined that an affinity 
threshold value of approximately 500 nM (preferably 50 nM or less) is correlated with the 
capacity of a peptide epitope to elicit a CTL response. These data are true for class I binding 
affinity measurements for naturally processed peptides and for synthesized T-cell epitopes. 

25 These data also indicate the important role of determinant selection in the shaping of T-cell 
responses (see, e.g., Schaeffer, et al, Proc. Natl Acad, Scl USA 86:4649-4653, 1989). 

Accordingly, CTL-inducing peptides preferably include those that have an IC50 for class I 
HLA molecules of 500 nM or less. In the case of motif-bearing peptide epitopes fix)m tumor 
associated antigens, a binding afifmity threshold of 200 nM has been shown to be associated with 

30 killing of tumor cells by resulting CTL populations. 

In a preferred embodiment, following assessment of binding acti vity for an HLA-A2 
allele-specific molecule, peptides exhibiting high or intermediate affinity are then considered for 
fiirther analysis. Selected peptides may be tested on other members of the supertype family. In 
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preferred embodiments, peptides that exhibit cross-reactive binding are then used in vaccines or 
in cellular screening analyses. 

For example, peptides that test positive in the HLA-A2 binding assay, /.e., that have 
binding affinity values of 500 nM or less, are assayed for the ability of the peptides to induce 
5 specific CTL responses in vitro. For instance, antigen-presenting cells that have been incubated 
with a peptide can be assayed for the ability to induce CTL responses in responder cell 
populations. Antigen-presenting cells can be normal cells such as peripheral blood mononuclear 
cells or dendritic cells (Inaba, et aL, J. Exp. Med. 166: 182 (1987); Boog, Eur. J. Immunol. 
18:219 [1988]). 

10 Alternatively, mutant mammalian cell lines that are deficient in their ability to load class 

I niQlecules with internally processed pq>tides, such as the mouse cell lines RMA-S (Karre, e/ 
fl/.. Nature, 319:675 (1986); Ljunggren, et aL, Eur. J. Immunol. 21:2963-2970 (1991)), and the 
human somatic T-cell hybrid, T-2 (Cerundolo. et aL, Nature 345:449-452 (1990)) and which 
have been transfected with the appropriate genes which encode human class I molecules are 

1 5 conveniently used, when peptide is exogenously added to them, to test for the capacity of the 
peptide to induce in vitro primary CTL responses. Other eukaryotic cell lines that can be used 
include various insect cell lines such as mosquito larvae (ATCC cell lines CCL 125, 126, 1660, 
1591, 6585, 6586), silkworm (ATTC CRL 8851), armyworm (ATCC CRL 171 1), moth (ATCC 
CCL 80) and Drosophila cell lines such as a Schneider cell line (see Schneider J. Embryol Exp. 

20 Morphol 27:353-365 [1927]). 

Peripheral blood lymphocytes are conveniently isolated following simple venipuncture or 
leukapheresis of normal donors or patients and used as the responder cell sources of CTL 
precursors. In one embodiment, the appropriate antigen-presenting cells are incubated with 10- 
100 liM of peptide in serum-free media for 4 hours under appropriate culture conditions. The 

25 peptide-loaded antigen-presenting cells are then incubated with the responder cell populations in 
vitro for 7 to 10 days under optimized culture conditions. Positive CTL activation can be 
determined by assaying the cultures for the presence of CTLs that kill radiolabeled target cells, 
both specific peptide-pulsed targets as well as target cells expressing endogenously processed 
form of the relevant virus or tumor antigen from which the peptide sequence was derived. 

30 Specificity and MHC restriction of the CTL is determined by testing against different 

peptide target cells expressing appropriate or inappropriate human MHC cliass L The peptides 
that test positive in the MHC binding assays and give rise to specific CTL responses are referred 
to herein as immunogenic peptides. 
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Kast, et al. {J. Immunol 152:3904-3912, 1994) have shown that motif-bearing peptides 
account for 90% of the epitopes that bind to allele-specific HLA class I molecules. In this study 
all possible peptides of 9 amino acids in length and overlapping by eight amino acids (240 
peptides), which cover the entire sequence of the E6 and E7 proteins of human papillomavirus 
5 type 16, were evaluated for binding to five allele-specific HLA molecules that are expressed at 
high frequency among different ethnic groups. This unbiased set of peptides allowed an 
evaluation of the predictive value of HLA class I motifs. From the set of 240 peptides, 22 
peptides were identified that bound to an allele-specific HLA molecules with high or 
intennediate affinity. Of these 22 peptides, 20, {i.e. 91%), were motif-bearing. Thus, this study 
10 demonstrated the value of motifs for the identification of peptide epitopes for inclusion in a 

vaccine: application of motif-based identification techniques eliminates screening of 90% of the 
potential epitopes. The quantity of available peptides, and the complexity of the screening 
process would make a comprehensive evaluation of an antigen highly difficult, if not impossible 
without use of motifs. 

IS An immunogenic peptide epitope of the invention may be included in a polyepitopic 

vaccine composition comprising additional peptide epitopes of the same antigen, antigens firom 
the same source, and/or antigens firom a different source. Moreover, class II epitopes can be 
. included along with class I epitopes. Peptide epitopes fi-om the same antigen may be adjacent 
epitopes that are contiguous in sequence or may be obtained ftom different regions of the 

20 protein. 

As noted in greater detail below, the immunogenic peptides can be prepared 
synthetically, such as by chemical synthesis or by recombinant DNA technology, or isolated 
fi-om natural sources such as whole viruses or tumors. Although the peptide will preferably be 
substantially fi-ee of other naturally occurring host cell proteins and fi-agments thereof, in some 
25 embodiments the peptides can be synthetically conjugated to native fi-agments or particles. 

The polypeptides or peptides can be a variety of lengths, either in their neutral 
(uncharged) forms or in forms which are salts, and either firee of modifications such as 
glycosylation, side chain oxidation, or phosphorylation or containing these modifications, subject 
to the condition that the modification not destroy the biological activity of the polypeptides as 
30 herein described. 

Desirably, the peptide will be as small as possible while still maintaining substantially all 
of the biological activity of the large peptide. When possible, it may be desirable to optimize 
peptide epitopes of the invention to a length of 9 or 10 amino acid residues, commensurate in 
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size with endogenously processed viral peptides or tumor cell peptides that are bound to MHC 
class I molecules on the cell surface. 

Peptides having the desired activity may be modified as necessary to provide certain 
desired attributes, e.g., improved phannacological characteristics, while increasing or at least 
5 retaining substantially all of the biological activity of the unmodified peptide to bind the desired 
MHC molecule and activate the appropriate T-cell. For instance, the peptides may be subject to 
various changes, such as substitutions, either conservative or non-conservative, where such 
changes might provide for certain advantages in their use, such as improved MHC binding. By 
conservative substitutions is meant replacing an amino acid residue with another which is 

10 biologically and/or chemically similar, e.g., one hydrophobic residue for another, or one polar 
residue for another. The substitutions include combinations such as Gly, Ala; Val, He, Leu, Met; 
Asp, Glu; Asn, Gin; Ser, Thr; Lys, Arg; and Phe, Tyr. The effect of single amino acid 
substitutions may also be probed using D-amino acids. Such modifications may be made using 
well known peptide synthesis procedures, as described in e.g., Merrifield, Science 232:341-347 

IS (1986), Barany and Merrifield, The Peptides, Gross and Meienhofer, eds. (N.Y., Academic 

Press), pp. 1-284 (1979); and Stewart and Young, Solid Phase Peptide Synthesis, (Rockford, III., 
Pierce), 2d Ed, (1984). 

The peptides can also be modified by extending or decreasing the compound's amino acid 
sequence, e.g., by the addition or deletion of amino acids. The peptides or analogs of the 

20 invention can also be modified by altering the order or composition of certain residues, it being 
readily appreciated that certain amino acid residues essential for biological activity, e.g., those at 
critical contact sites or conserved residues, may generally not be altered without an adverse 
effect on biological activity. The non-critical amino acids need not be limited to those naturally 
occurring in proteins, such as L-o^-amino acids, but may include non-natxu:al amino acids as well, 

25 such as i3-7-6 -amino acids, as well as many derivatives of L-a-amino acids such as D-isomers of 
natural amino acids. 

Typically, a series of peptides with single amino acid substitutions are employed to 
determine the effect of electrostatic charge, hydrophobicity, etc. on binding. For instance, a 
series of positively charged (e.g., Lys or Arg) or negatively charged (e.g., Glu) amino acid 
30 substitutions are made along the length of the peptide revealing different patterns of sensitivity 
towards various MHC molecules and T-cell receptors. In addition, multiple substitutions using 
small, relatively neutral moieties such as Ala, Gly, Pro, or similar residues may be employed. 
The substitutions may produce multi-epitopic peptides which are homo-oligomers or hetero- 
oligomers. The number and types of residues which are substituted or added depend on the 
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spacing necessary between essential contact points and certain functional attributes which are 
sought (e.g., hydrophobicity versus hydrophilicity). Increased binding affinity for an MHC 
molecule or T-cell receptor may also be achieved by such substitutions, compared to the affinity 
of the parent peptide. In any event, such substitutions generally employ amino acid residues or 
5 other molecular fragments chosen to avoid, for example, steric and charge interference which 
might disrupt binding. 

Amino acid substitutions are typically of single residues. Substitutions, deletions, 
insertions or any combination thereof may be combined to arrive at a final peptide. 
Substitutional variants are those in which at least one residue of a peptide has been removed and 
10 a different residue inserted in its place. Such substitutions generally are made in accordance with 
the following Table 2 when it is desired to finely modulate the characteristics of the peptide. 



TABLE 2 



Original Residue 


Exemplarv Substitution 


Ala 


Ser 


Arg 


Lys, His 


Asn 


Gin 


Asp 


Glu 


Cys 


Ser 


Gin 


Asn 


Glu 


Asp 


Gly 


Pro 


His 


Lys; Arg 


He 


Leu; Val 


Leu 


He; Val 


Lys 


Arg; His 


Met 


Leu; He 


Phe 


Tyr; Trp 


Ser 


Thr 


Thr 


Ser 


Tip 


Tyr; Phe 


Tyr 


Trp; Phe 


Val 


He; Leu 
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Substantial changes in function (e.g., affinity for MHC molecules or T-cell receptors) are 
made by selecting substitutions that are less conservative than those in Table 2, i.e., selecting 
residues that differ more significantly in their effect on maintaining (a) the structure of the 
5 peptide backbone in the area of the substitution, for example as a sheet or helical conformation, 
(b) the charge or hydrophobicity of the molecule at the target site or (c) the bulk of the side 
chain. The substitutions which in general are expected to produce the greatest changes in peptide 
properties will be those in which (a) hydrophilic residue, e.g. seryl» is substituted for (or by) a 
hydrophobic residue, e.g. leucyl, isoleucyl, phenylalanyl, valyl or alanyl; (b) a residue having an 

10 electropositive side chain, e.g., lysl, arginyl, or histidyl, is substituted for (or by) an 

electronegative residue, e.g. glutamyl or aspartyl; or (c) a residue having a bulky side chain, e.g, 
phenylalanine, is substituted for (or by) one not having a side chain, e.g., glycine. 

The peptides may also comprise isosteres of two or more residues in the immunogenic 
peptide. An isostere as defined here is a sequence of two or more residues that can be substituted 

15 for a second sequence because the steric conformation of the first sequence fits a binding site 
specific for the second sequence. The term specifically includes peptide backbone 
modifications well known to those skilled in the art. Such modifications include modifications 
of the amide nitrogen, the a-carbon, amide carbonyl, complete replacement of the amide bond, 
extensions, deletions or backbone crosslinks. See, generally, Spatola, Chemistrv and 

20 Biochemistry of Amino Acids, Peptides and Proteins, Vol. Vn (Weinstein ed., 1983). 

Modifications of peptides with various amino acid mimetics or unnatural amino acids are 
particularly useful in increasing the stability of the peptide in vivo. Stability can be assayed in a 
number of ways. For instance, peptidases and various biological media, such as human plasma 
and serum, have been used to test stability. See^ e.g., Verhoef, et aL, Eur, 7. DrugMetab, 

25 Pharmacokin. 11:291-302(1986). Halflife ofthe peptides ofthe present invention is 

conveniently determined using a 25% human serum (v/v) assay. The protocol is generally as 
follows. Pooled human serum (Type AB, non-heat inactivated) is delipidated by centrifiigation 
before use. The serum is then diluted to 25% with RPMI tissue culture media and used to test 
peptide stability. At predetermined time intervals a small amount of reaction solution is removed 

30 and added to either 6% aqueous trichloracetic acid or ethanol. The cloudy reaction sample is 
cooled (4®C) for 15 minutes and then spun to pellet the precipitated serum proteins. The 
presence of the peptides is then determined by reversed-phase HPLC using stability-specific 
chromatography conditions. 
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The peptides of the present invention or analogs thereof which have CTL stimulating 
activity may be modified to provide desired attributes other than improved serum half life. For 
instance, the ability of the peptides to induce CTL activity can be enhanced by linkage to a 
sequence which contains at least one epitope that is capable of inducing a T helper cell response. 
5 In some embodiments, the T helper peptide is one that is recognized by T helper cells in 

the majority of the population. This can be accomplished by selecting amino acid sequences that 
bind to many, most, or all of the MHC class II molecules. These are known as "loosely 
MHC-restricted" T helper sequences. Examples of amino acid sequences that are loosely 
MHC-restricted include sequences from antigens such as Tetanus toxin at positions 830-843 

1 0 (QYIKANSKFIGI're), Plasmodium falciparum circumsporozoite (CS) protein at positions 

378-398 (DIEKKIAKMEKASSVFNVVNS), and Streptococcus l8kD protein at positions 1-16 
(YGAVDSILGGVATYGAA). 

Alternatively, it is possible to prepare synthetic peptides capable of stimulating T helper 
lymphocytes, in a loosely MHC-restricted fashion, using amino acid sequences not found in 

1 5 nature. These synthetic compoxmds, called Pan-DR-binding epitopes or PADRE™ molecules 
(Epinmiune, San Diego, CA), are designed on the basis of their binding activity to most 

HLA-DR (human^^^N^ class JO molecules e.g., U^. Patent Number 5,736,142). 

Particularly preferred inamunogenic peptides/T helper conjugates are linked by a spacer 
mpleQule. The spacer is typically comprised of relatively small, neutral molecules, such as 

20 amino acids or amino acid mimetics, which are substantially uncharged under physiological 
conditions. The spacers are typically selected from, e.g., Ala, Gly, or other neutral spacers of 
nonpolar amino acids or neutral polar amino acids. It will be understood that the optionally 
present spacer need not be comprised of the same residues and thus may be a hetero- or homo- 
oligomer. When present, the spacer will usually be at least one or two residues, more usually 

25 three to six residues. Alternatively, the CTL peptide may be linked to the T helper peptide 
without a spacer. 

The immunogenic peptide may be linked to the T helper peptide, either directly or via a 
spacer, at the amino or carboxy tenninus of the CTL peptide. The amino terminus of either the 
immunogenic peptide or the T helper peptide may be acylated. Exemplary T helper peptides 
30 include tetanus toxoid 830-843, influenza 307-319, malaria circumsporozoite 382-398 and 378- 
389. 

In some embodiments it may be desirable to include in the pharmaceutical compositions 
of the invention at least one component which primes CTL. Lipids have been identified as 
agents capable of priming CTL in vivo against viral antigens. For example, palmitic acid 

20 
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residues can be attached to the alpha and epsilon amino groups of a Lys residue and then Hnked, 
e.g., via one or more linking residues such as Gly, Gly-Gly-, Ser, Ser-Ser, or the like, to an 
immunogenic peptide. The lipidated peptide can then be injected directly in a micellar form, 
incorporated into a liposome or emulsified in an adjuvant, e.g., incomplete Freund's adjuvant. In 
5 a preferred embodiment a particularly effective immunogen comprises palmitic acid attached to 
alpha and epsilon amino groups of Lys, which is attached via linkage, e.g., Ser-Ser, to the amino 
terminus of the immxmogenic peptide. 

As another example of lipid priming of CTL responses, E, coli lipoproteins, such as 
tripahnitoyl-S-glycerylcysteinlyseryl-serine (P3CSS) can be used to prime virus specific CTL 

10 when covalently attached to an appropriate peptide. See, Deres» et aL, Nature 342:561-564 
(1989). Peptides of the invention can be coupled to P3CSS, for example^ and the lipopeptide 
administered to an individual to specifically prime a CTL response to the target antigen. Further, 
as the induction of neutralizing antibodies can also be primed with P3CSS conjugated to a 
peptide which displays an appropriate epitope, the two compositions can be combined to more 

1 5 effectively elicit both humoral and cell-mediated responses to infection. 

In addition, additional amino acids can be added to the tennini of a peptide to provide for 
ease of linking peptides one to another, for coupling to a carrier support, or larger peptide, for 
modifying the physical or chemical properties of the peptide or oligopeptide, or the Hke. Amino 
acids such as tyrosine, cysteine, lysine, glutamic or aspartic acid, or the like, can be introduced at 

20 the C- or N-terminus of the peptide or oligopeptide. Modification at the C terminus in some 
cases may alter binding characteristics of the peptide. In addition, the peptide or oligopeptide 
sequences can differ from the natural sequence by being modified by terminal-NH2 acylation, 
e.g., by alkanoyl (C1-C20) or thioglycolyl acetylation, terminal-carboxyl amidation, e.g., 
ammonia, methylamine, etc. In sorne instances these modifications may provide sites for linking 

25 to a support or other molecule. 

The peptides of the invention can be prepared in a wide variety of ways. Because of their 
relatively short size, the peptides (discrete epitopes or polyepitopic peptides) can be synthesized 
in solution or on a solid support in accordance with conventional techniques. Various automatic 
synthesizers are commercially available and can be used in accordance with known protocols. 

30 SeCy for example,' Stewart and Yoxmg, Solid Phase Peptide Svnthesis. 2d. ed.. Pierce Chemical 
Co. (1984), supra. 

Alternatively, preparation of peptides of the invention can comprise use of recombinant 
DNA technology wherein a nucleotide sequeiice which encodes an immunogenic peptide of 
interest is inserted into an expression vector, transformed or transfected into an appropriate host 

21 
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cell and cultivated under conditions suitable for expression. These procedures are generally 
known in the art, as described generally in Sambrook, et ai. Molecular Cloning, A Laboratory 
Manual, Cold Spring Harbor Press, Cold Spring Harbor, New York (1982), which is 
incorporated herein by reference. Thus, fusion proteins which comprise one or more peptide 
5 sequences of the invention can be used to present the appropriate T-cell epitope. 

As the coding sequence for peptides of the length contemplated herein can be synthesized 
by chemical techniques, for example, the phosphotriester method of Matteucci, et aL^ J. Am, 
Chem. Soc. 103:3185 (1981), modification can be made simply by substituting the appropriate 
base(s) for those encoding the native peptide sequence. The coding sequence can then be 

10 provided with appropriate linkers and ligated into expression vectors commonly available in the 
art, and the vectors used to transform suitable hosts to produce the desired fusion protein. A 
number of such vectors and suitable host systems are now available. For expression of the 
fusion proteins, the coding sequence will be provided with operably linked start and stop codons, 
promoter and terminator regions and usually a repHcation system to provide an expression vector 

IS for expression in the desired cellular host. For example, promoter sequences compatible with 
bactoial hosts are provided in plasmids containing convenient restriction sites for insertion of 
the desired coding sequence. The resulting expression vectors are transformed into suitable 
bacterial hosts. Of course, yeast or manmialian cell hosts may also be used, employing suitable 
vectors and control sequences. 

20 The peptides of the present invention and pharmaceutical and vaccine compositions 

thereof are useful for administration to mammals, particularly humans, to therapeutically treat 
and/or prevent infections and cancer. Examples of diseases which can be treated using the 
immunogenic peptides of the invention include prostate cancer, hepatitis B, hepatitis C, AIDS, 
renal carcinoma, cervical carcinoma, lymphoma, CMV infection and condlyloma acuminatum. 

25 For pharmaceutical compositions, the immunogenic peptides of the invention are often 

administered to an individual akeady suffering from cancer or infected with the virus of interest. 
Those in the incubation phase or the acute phase of infection can be treated with the 
immunogenic peptides separately or in conjunction with other treatments, as appropriate. In 
therapeutic applications, compositions are administered to a patient in an amount sufficient to 

30 eHcit an effective CTL response to the infectious disease agent or tumor antigen and to cure or at 
least partially arrest symptoms and/or complications. An amount adequate to accomplish this is 
defined as a "therapeutically effective dose" or "unit dose". Amounts effective for this use will 
depend on, e.g., the peptide composition, the manner of administration, the stage and severity of 
the disease being treated, the weight and general state of health of the patient, and the judgment 
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of the prescribing physician. Generally for humans the dose range for the initial immunization 
(that is for therapeutic or prophylactic administration) is from about 1 .0 fig to about 20,000 /xg of 
peptide for a 70 kg patient, preferably, 100 /ig 1 50 fig 200 fxg 250 fig 300 fig 400 fig 
or 500 fig -20,000 ;xg, followed by boosting dosages in the same dose range pursuant to a 
5 boosting regimen over weeks to months depending upon the patient's response and condition by 
measuring specific CTL activity in the patient's blood. In embodiments where recombinant 
nucleic acid administration is used, the administered material is titrated to achieve the 
appropriate therapeutic response. It must be kept in mind that the peptides and compositions of 
the present invention may generally be employed in serious disease states, that is, life- 

10 threatening or potentially life threatening situations. In such cases, in view of the minimization 
of extraneous substances in the compositions of the invention and , e,g. , the relative nontoxic 
nature of the peptides, it is possible and may be felt desirable by the treating physician to 
administer substantial excesses of these compositions. 

For therapeutic use, administration should begin at the first sign of infection or the 

15 detection or surgical removal of tumors or shortly after diagnosis m the case of acute infection. 
This is followed by boosting doses until at least symptoms are substantially abated and for a 
period thereafter. In chronic infection, loading doses followed by boosting doses may be 
required. 

Treatment of an infected individual with the compositions of the invention may hasten 
20 resolution of the infection in acutely infected individuals. For those individuals susceptible (or 
predisposed) to developing chronic infection the compositions are particularly useful in methods 
for preventing the evolution from acute to chronic infection. Where the susceptible individuals 
are identified prior to or during infection, for instance, as described herein, the composition can 
be targeted to them, minimizing need for administration to a larger population. 
25 The peptide compositions can also be used for the treatment of chronic infection and to 

stimulate the immune system to eliminate, e.g., virus-infected cells in carriers. It is important to 
provide an amount of immuno-potentiating peptide in a formulation and mode of administration 
sufficient to effectively stimulate a cytotoxic T-cell response. Thus, for treatment of chronic 
infection, inununizing doses followed by boosting doses at established intervals, e,g., from one 
30 to four weeks, may be required, possibly for a prolonged period of time to effectively immunize 
an individual. In the case of chronic infection, administration should continue until at least 
clinical symptoms or laboratory tests indicate that the infection has been eliminated or 
substantially abated and for a period thereafter. 
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The pharmaceutical compositions for therapeutic treatment are intended for parenteral, 
topical, oral or local administration. Peptides of the invention can be administered in a form of 
nucleic acids that encode the peptides. Preferably, the pharmaceutical compositions are 
administered parenterally, e.g., intravenously, subcutaneously, intradermally, or intramuscularly. 
5 Thus, the invention provides compositions for parenteral administration which comprise a 

solution of the immunogenic peptides dissolved or suspended in an acceptable carrier, preferably 
an aqueous carrier. A variety of aqueous carriers may be used, e.g., water, buffered water, 0.8% 
saline, 0.3% glycine, hyaluronic acid and the like. These compositions may be sterilized by 
conventional, well known sterilization techniques, or may be sterile filtered. The resulting 

10 aqueous solutions may be packaged for use as is, or lyophilized, the lyophilized preparation 
being combined with a sterile solution prior to administration. The compositions may contain 
pharmaceutically acceptable auxiliary substances as required to approximate physiological 
conditions, such as pH adjusting and buffering agents, tonicity adjusting agents, wetting agents 
and the like, for example, sodium acetate, sodium lactate, sodium chloride, potassium chloride^ 

IS calcium chloride, sorbitan monolaurate, triethanolamine oleate, etc. 

The concentration of CTL stimulatory peptides of the invention in the pharmaceutical 
formulations can vary widely, i.e., from less than about 0.1%, usually at or at least about 2% to 
as much as 20% to 50% or more by weight, and will be selected primarily by fluid volumes, 
viscosities, etc., in accordance with the particular mode of administration selected. A human unit 

20 dose form of the peptide composition is typically included in a pharmaceutical composition that 
comprises a human unit dose of an acceptable carrier, preferably an aqueous carrier, and is 
administered in a volume of fluid that is known by those of skill in the art to be used for 
administration of such compositions to humans. 

The peptides of the invention may also be administered via liposomes, which serve to 

25 target the peptides to a particular tissue, such as lymphoid tissue, or targeted selectively to 
infected cells, as well as increase the half- life of the peptide composition. Liposomes include 
emulsions, foams, micelles, insoluble monolayers, liquid crystals, phospholipid dispersions, 
lamellar layers and the like. In these preparations the peptide to be delivered is incorporated as 
part of a liposome, alone or in conjunction with a molecule which binds to» e.g., a receptor 

30 prevalent among lymphoid cells, such as monoclonal antibodies which bind to the CD45 antigen, 
or with other therapeutic or inununogenic compositions. Thus, liposomes either filled or 
decorated with a desired peptide of the invention can be directed to the site of lymphoid cells, 
where the liposomes then deliver the selected therapeutic/immunogenic peptide compositions. 
Liposomes for use in the invention are formed from standard vesicle-forming lipids, which 
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generally include neutral and negatively charged phospholipids and a sterol, such as cholesterol. 
The selection of lipids is generally guided by consideration of, e.g., liposome size, acid lability 
and stability of the liposomes in the blood stream. A variety of methods are available for 
preparing liposomes, as described in, e.g., Szoka, et al, Ann, Rev, Biophys. Bioeng. 9:467 
5 (1980), US. Patent Nos. 4,235,871, 4,501,728, 4,837,028, and 5,019,369. 

For targeting to the immune cells, a ligand to be incorporated into the liposome can 
include, e.g., antibodies or fragments thereof specific for cell surface determinants of the desired 
immune system cells. A liposome suspension containing a peptide may be administered 
intravenously, locally, topically, etc, in a dose which varies according to, inter alia, the manner 

10 of administration, the peptide being delivered, and the stage of the disease being treated. 

For solid compositions, conventional nontoxic solid carriers may be used which include, 
for example, pharmaceutical grades of mannitol, lactose, starch, magnesium stearate, sodium 
saccharin, talcum, cellulose, glucose, sucrose, magnesium carbonate, and the like. For oral 
administration, a pharmaceutically acceptable nontoxic composition is formed by incorporating 

1 5 any of the normally employed excipients, such as those carriers previously listed, and generally 
10-95% of active ingredient, that is, one or more peptides of the invention, and more preferably 
at a concentration of 25%-75%. 

For aerosol administration, the immunogenic peptides are preferably supplied in finely 
divided form along with a surfactant and propellant. Typical percentages of peptides are 0.01%- 

20 20% by weight, preferably 1%-10%. The surfactant must, of course, be nontoxic, and preferably 
soluble in the propellant. Representative of such agents are the esters or partial esters of fatty 
acids containing from 6 to 22 carbon atoms, such as caproic, octanoic, lauric, palmitic, stearic, 
linoleic, linolenic, olesteric and oleic acids with an aliphatic polyhydric alcohol or its cyclic 
anhydride. Mixed esters, such as mixed or natural glycerides may be employed. The surfactant 

25 may constitute 0. 1 %-20% by weight of the composition, preferably 0.25-5%. The balance of the 
composition is ordinarily propellant. A carrier can also be included, as desired, as with, e.g., 
lecithin for intranasal delivery. 

Accordingly, an aspect the present invention is directed to vaccines which contain as an 
active ingredient an immunogenically effective amount of an immxmogenic peptide as described 

30 herein. The peptides may also be administered in the form of nucleic acids which encode 

peptides of the invention upon expression in the recipient. The peptide(s) may be introduced into 
a host, including humans, linked to its own carrier or as a homopolymer or heteropolymer of 
active peptide units. Such a polymer has the advantage of increased immunological reaction and, 
where different peptides are used to make up the polymer, the additional ability to induce 
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antibodies and/or CTLs that react with different antigenic determinants of the virus or tumor 
cells. Useful carriers are well known in the art, and include, e.g., thyroglobulin, albumins such 
as human serum albumin, tetanus toxoid, polyamino acids such as poly(lysine:glutamic acid), 
influenza, hepatitis B virus core protein, hepatitis B virus recombinant vaccine and the like. The 
5 vaccines can also contain a physiologically tolerable (acceptable) diluent such as water, 

phosphate buffered saline, or saline, and further typically include an adjuvant. Materials such as 
incomplete Freund's adjuvant, aluminum phosphate, aluminum hydroxide, or alum are materials 
well known in the art as adjuvants. And, as mentioned above, CTL responses can be primed by 
conjugating peptides of the invention to lipids, such as P3CSS. Upon immunization with a 

10 peptide composition as described herein, via injection, aerosol, oral, transdermal or oflier route, 
the immune system of the host responds to the vaccine by producing large amounts of CTLs 
specific for the desired antigen, and the host becomes at least partially immune to later infection, 
or resistant to developing chronic infection. 

In some instances it may be desirable to combine the peptide vaccines of the invention 

1 5 with vaccines which induce neutralizing antibody responses to the virus of interest, particularly 
to viral envelope antigens. 

For therapeutic or inununization purposes, peptides of the invention can be administered 
in the form of nucleic acids encoding one or more of the peptides of the invention. The nucleic 
acids can encode a peptide of the invention and optionally one or more additional molecules. A 

20 number of methods are conveniently used to deliver nucleic acids to a patient. For instance, 
nucleic acid can be delivered directly, as "naked DNA", This approach is described, for 
instance, in Wolff, et aL Science 247: 1465-1468 (1990) as well as U.S. Patent Nos. 5,580,859 
and 5,589,466. Nucleic acids can also be administered using ballistic delivery as described, for 
instance, in U.S. Patent No. 5,204,253. Particles comprised solely of DNA can be administered. 

25 Alternatively, DNA can be adhered to particles, such as gold particles. 

The nucleic acids can also be delivered complexed to cationic compounds, such as 
cationic lipids, Lipid-mediated gene delivery methods are described, for instance, in WO 
96/18372; WO 93/24640; Mannino & Gould-Fogerite, BioTechniques 6(7); 682-691 (1988); 
Rose U.S. Pat No. 5,279,833; WO 91/06309; and Feigner, et aL, Proc. Natl Acad. Scl USA 84: 

30 7413-7414(1987). 

The peptides of the invention can also be expressed by attenuated viral hosts, such as 
vaccinia or fowlpox. This approach involves the use of vaccinia virus as a vector to express 
nucleotide sequences that encode the peptides of the invention. Upon introduction into an 
acutely or chronically infected host or into a noninfected host* the recombinant vaccinia virus 
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expresses the immunogenic peptide, and thereby elicits a host CTL response. Vaccinia vectors 
and methods useful in immunization protocols are described in, e.g., U.S. Patent No. 4,722,848. 
Another vector is BCG (Bacille Calmette Guerin). BCG vectors are described, e.g., in Stover, et 
al {Nature 351 :456-460 (1991)). A wide variety of other vectors useful for therapeutic 
5 administration or immunization of the peptides of the invention, e.g.. Salmonella typhi vectors 
and the like, will be apparent to those skilled in the art from the description herein. 

A preferred means of administering nucleic acids encoding the peptides of the invention 
uses minigene constructs encoding multiple epitopes of the invention optionally together with 
other molecules. To create a DNA sequence encoding the selected CTL epitopes (mmigene) for 

10 expression in human cells, the amino acid sequences of the epitopes are, e,g., reverse translated. 
A human codon usage table is used to guide the codon choice for each amino acid. These 
epitope-encoding DNA sequences are directly adjoined, creating a molecule that encodes a 
continuous polypeptide sequence. Optionally, to optimize expression and/or immunogenicity, 
additional elements can be incorporated into the minigene design. Examples of amino acid 

IS sequence that could be reverse translated and included in the minigene sequence include; helper 
T lymphocyte epitopes, a leader (signal) sequence, and an endoplasmic reticulum retention 
signal. In addition, MHC presentation of CTL epitopes may be improved by including synthetic 
(e.g. poly-alanine) or naturally-occurring flanking sequences adjacent to the CTL epitopes. 

The minigene sequence is converted to DNA by assembling oligonucleotides that encode 

20 the plus and minus strands of the minigene. Overlapping oligonucleotides (30-1 00 bases long) 
are synthesized, phosphorylated, purified and annealed under appropriate conditions using well 
known techniques, he ends of the oligonucleotides are joined using T4 DNA ligase. This 
synthetic minigene, encoding the CTL epitope polypeptide, can then cloned into a desired 
expression vector. 

25 Standard regulatory sequences well known to those of skill in the art are generally 

included in the vector to ensure expression in the target cells. Several vector elements are 
required: a promoter with a down-stream cloning site for minigene insertion; a polyadenylation 
signal for efficient transcription termination; an E, coli origin of replication; and an E. coli 
selectable marker (e.g. ampicilhn or kanamycin resistance). Numerous promoters can be used 

30 for this purpose, e.g. , the human cytomegalovirus (hCMV) promoter. See, U.S. Patent Nos. 
5,580,859 and 5,589,466 for other suitable promoter sequences. 

Additional vector modifications may be desired to optimize minigene expression and 
immunogenicity. In some cases, introns are required for efficient gene expression, and one or 
more synthetic or naturally-occurring introns could be incorporated into the transcribed region of 
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the minigene. The inclusion of mRN A stabilization sequences can also be considered for 
increasing minigene expression. It has recently been proposed that immunostiraulatory 
sequences (ISSs or CpGs) play a role in the immunogenicity of DNA vaccines. These sequences 
could be included in the vector, outside the minigene coding sequence, if found to enhance 
5 immunogenicity. 

In some embodiments, a bicistronic expression vector, to allow production of the 
minigene-encoded epitopes and a second protein included to enhance or decrease 
immunogenicity can be used. Examples of proteins or polypeptides that could beneficially 
enhance the immune response if co-expressed include cytokines (e.g., IL2, IL12, GM-CSF), 

10 cytoldne-inducing molecules (e.g. LeEF) or costimulatory molecules. Moreover, if helper T 
lymphocyte (HTL) epitopes are employed, the HTL epitopes can be joined to intracellular 
targeting signals and expressed separately &om the CTL epitopes. This allows direction of the 
HTL epitopes to a cell compartment different than the CTL epitopes. This can facilitate more 
efficient entry of HTL epitopes into the MHC class 11 pathway, thereby facilitating and 

IS improving CTL induction. In contrast to CTL induction, specifically decreasing the immune 
response by co-expression of immunosuppressive molecules (e.g. TGF-jS) may be beneiScial in 
certain diseases. 

Once an expression vector is selected, the minigene is cloned into the polylinker region 
downstream of the promoter. This plasmid is transformed into an appropriate E. coli strain, and 

20 DNA is prepared using standard techniques. The orientation and DNA sequence of the 

minigene, as well as all other elements included in the vector, are confirmed using restriction 
mapping and DNA sequence analysis. Bacterial cells harboring the correct plasmid can be 
stored as a master cell bank and a working cell bank. 

Therapeutic quantities of plasmid DNA are produced, e.g., by fermentation in E, coli, 

25 followed by purification. Aliquots fi"om the working cell bank are used to inoculate fermentation 
medium (such as Terrific Broth), and grown to saturation in shaker flasks or a bioreactor 
according to well known techniques. Plasmid DNA can be purified using standard 
bioseparation technologies such as solid phase anion-exchange resins supplied by Quiagen. If 
required, supercoiled DNA can be isolated fi'om the open circular and linear forms using get 

30 electrophoresis or other methods. 

Purified plasmid DNA can be prepared for injection using a variety of formulations. The 
simplest of these is reconstitution of lyophilized DNA in sterile phosphate^buffer saline (PBS). 
A variety of methods have been described, and new techniques may become available. As noted 
above, nucleic acids are conveniently formulated with cationic lipids. In addition, glycolipids, 
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fiisogenic liposomes, peptides and compounds referred to collectively as protective, interactive, 
non-condensing (PINC) could also be complexed to purified plasraid DNA to influence variables 
such as stability, intramuscular dispersion, or trafficking to specific organs or cell types. 

Target cell sensitization can be used as a functional assay for expression and MHC class I 
5 presentation of minigene-encoded CTL epitopes. The plasmid DNA can be introduced into a 
mammalian cell line that is suitable as a target for standard CTL chromium release assays. The 
transfection method used will be dependent on the final formulation. Electroporation can be 
used for "naked" DNA, whereas cationic lipids allow direct in vitro transfection. A plasmid 
expressing green fluorescent protein (GFP) can be co-transfected to allow enrichment of 

10 transfected cells using fluorescence activated cell sorting (FACS). These cells are then 

chromium-5 1 labeled and used as target cells for epitope-specific GTL lines. Cytolysis, detected 
by 51Cr release, indicates production of MHC presentation of minigene-encoded CTL epitopes. 

In vivo immunogenicity is a second approach for functional testing of minigene DNA 
formulations. Transgenic mice expressing appropriate human MHC molecules can be 

1 5 immunized with the DNA product. The dose and route of administration are formulation 
dependent {e.g. TM for DNA in PBS, IP for lipid-complexed DNA). Twenty-one days after 
immunization, splenocytes are harvested and restimulated for 1 week in the presence of peptides 
encoding each epitope being tested. These effector cells (CTLs) are assayed for cytolysis of 
peptide-loaded, chromium-Sl labeled target cells using standard techniques. Lysis of target cells 

20 sensitized by MHC loading of peptides corresponding to minigene-encoded epitopes 
demonstrates DNA vaccine fimction for in vivo induction of CTLs. 

Transgenic animals of appropriate haplotypes may additionally provide a useful tools in 
optimizing the in vivo inmiunogenicity of minigene DNA. In addition, animals such as monkeys 
having conserved HLA molecules with cross reactivity to CTL epitopes recognized by human 

25 MHC molecules can be used to determine human immunogenicity of CTL epitopes (Bertoni, et 
al.. J. ImmunoL 161:4447-4455 (1998)). 

Such in vivo studies are required to address the variables crucial for vaccine development, 
which are not easily evaluated by in vitro assays, such as route of administration, vaccine 
formulation, tissue biodistribution, and involvement of primary and secondary lymphoid organs. 

30 Because of their simplicity and flexibility, HLA transgenic mice represent an attractive 

alternative, at least for initial vaccine development studies, compared to more cumbersome and 
expensive studies in higher animal species, such as nonhuman primates. 

Antigenic peptides can be used to elicit CTL ex vivo, as well. The resulting CTL, can be 
used to treat chronic infections viral or bacterial) or tumors in patients that do not respond 
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to Other conventional forms of therapy, or will not respond to a peptide vaccine approach of 
therapy. Ex vivo CTL responses to a particular pathogen (infectious agent or tumor antigen) are 
induced by incubating in tissue culture the patient's CTL precursor cells (CTLp) together with a 
source of antigen-presenting cells (APC) and the appropriate immimogenic peptide. After an 
5 appropriate incubation time (typically 1-4 weeks), in which the CTLp are activated and mature 
and expand into effector CTL, the cells are infused back into the patient, where they will destroy 
their specific target cell (an infected cell or a tumor cell). 

The peptides may also find use as diagnostic reagents. For example, a peptide of the 
invention may be used to determine the susceptibility of a particular individual to a treatment 

10 regimen which employs the peptide or related peptides, and thus may be helpful in modifying an 
existing treatment protocol or in determining a prognosis for an affected individual. 

For example, a peptide of the invention may be used in a tetramer staining assay to assess 
peripheral blood mononuclear cells for the presence of antigen-specific CTLs following 
exposure to a pathogen or immunogen. The HLA-tetrameric complex is used to directly 

15 visualize antigen-specific CTLs {see, e.g., Ogg, et ai Science 279:2103-2106, 1998; and Altman, 
et al Science 174:94-96, 1996) and determine the fi-equency of the antigen-specific CTL 
population in a sample of peripheral blood mononuclear cells. A tetramer reagent using a 
peptide of the invention may be generated as follows: A peptide that binds to an allele-specific 
HLA molecule or supertype molecule is refolded in the presence of the corresponding HLA 

20 heavy chain and P2-niicroglobulin to generate a trimolecular complex. The complex is 
biotinylated at the carboxyl terminal end of the heavy chain at a site that was previously 
engineered into the protein. Tetramer formation is then induced by the addition of streptavidin. 
By means of fluorescently labeled streptavidin, the tetramer can be used to stain antigen-specific 
cells. The cells may then be identified, for example, by flow cytometry. Such an analysis may 

25 be used for diagnostic or prognostic purposes. 

In addition, the peptides may also be used to predict which individuals will be at 
substantial risk for developing chronic infection. 

The present apphcation is related to U.S. Serial No. 08/589,108, filed 1/23/96 and now 
abandoned, and to U.S. Serial No. 08/205,713 filed 3/4/94, which is a continuation-in-part of 

30 U.S. Serial No. 08/1 59, 1 84, filed 1 1/29/93 and now abandoned, which is a continuation-in-part 
of U.S. Serial No. 08/073,205 filed 6/4/93 and now abandoned, which is a continuation-in-part of 
U.S. Serial No. 08/027,146 filed 3/5/93 and now abandoned. The application is also related to 
U.S. Serial No. 60/013,980 filed 3/21/96 and now abandoned, U.S. Serial No. 08/454,033 filed 
5/26/95, U.S. Serial No. 08/349,177 filed 12/2/94, and U.S. Serial No. 08/753,622 filed 1//27/96 
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Each of the above-referenced applications is incorporated herein by 



EXAMPLES 

Example 1: Peptides 

5 Peptides utilized were synthesized as previously described by Ruppert, J. et al., 

"Prominent Role of Secondary Anchor Residues in Peptide Binding to HLA-A2.1 Molecules/' 
Cell 74:929-937 (1993) or purchased as crude material from Chiron Mimotopes (Chiron Corp,, 
Australia). Synthesized peptides were typically purified to >95% homogeneity by reverse phase 
HPLC. Purity of synthesized peptides was determined using analytical reverse-phase HPLC and 
10 amino acid analysis, sequencing, and/or mass spectrometry. Lyophilized peptides were 
resuspended at 4-20 mg/ml in 100% DMSO, then diluted to required concentrations in PBS 
+0.05% (v/v) NP40 (Fluka Biochemika, Buchs, Switzerland). 

Example 2: MHC Purification 

15 the EBVtransfonned cell lines JY (A*0^ 

(A*0205), CLA (A*0206), KNE (A*0207), AP (A*0207), and AMAI (A*6802) were used as the 
primary source of MHC molecules. Single MHC allele transfected 721.221 liiies were also used 
as sources of A*0202 and A*0207. Cells were maintained in vitro by culture in RPMI 1640 
medium (Flow Laboratories, McLean, VA), supplemented with 2 mM L-glutamine (GIBCO, 

20 Grand Island, NY), 100 U (100 jig/ml) penicillin-streptomycin solution (GDSCO), and 10% heat- 
inactivated FCS (Hazelton Biologies). Large scale cultures were maintained in roller bottles. 
HLA molecules were purified fix>m cell lysates (Sidney, J., ei al,^ **The Measurement of 
MHC/Peptide Interactions by Gel Infiltration," Curr Prot Immunol 18,3.1-18.3.19 (1998)). 
Briefly, cells were lysed at a concentration of 10^ cells/ml in 50 mM Tris-HCL, pH 8.5, 

25 containing 1 % (v/v) NP-40 1 50 mM NaCl, 5 mM EDTA, and 2 mM PMSF. Lysates were then 
passed through 0.45 |jM filters, cleared of nuclei and debris by centrifugation at 10,000 x g for 
20 minutes and MHC molecules purified by monoclonal antibody-based affinity 
chromatography. 

For affinity purification, columns of inactivated Sepharose CL4B and Protein A 
30 Sepharose were used as pre-columns. Class I molecules were captured by repeated passage over 
Protein A Sepharose beads conjugated with the anti-HLA (A, B, C) antibody W6/32 (Sidney, J., 
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et aL, supra). HLA-A molecules were further purified from HLA-B and -C molecules by 
passage over a B 1.23. 2 column. After 2 to 4 passages the W6/32 column was washed with 10- 
column volumes of 10 mM Tris-HCL, pH8.0 with 1% (v/v) NP-40, 2-column volumes of PBS, 
and 2-colunm volumes of PBS containing 0.4% (w/v) n-octylglucoside. Class I molecules were 
5 eluted with 50 mM dimethylamine in 0.15 M NaCl containing 0.4% (w/v) n-octylglucoside, pH 
1 1.5.A 1/26 volume of 2.0 M Tris, pH 6.8, was added to the eluate to reduce the pH to -8.0. The 
elixate was then concentrated by centrifugation in Centriprep 30 concentrators at 2000 rpm 
(Amicon, Beverly, MA). Protein purity, concentration, and effectiveness of depletion steps were 
monitored by SDS-PAGE and BCA assay. 

10 

Examples: MHC-Peptide Binding Assavs 

Quantitative assays to measure the binding of peptides to soluble Class I molecules are 
based on the inhibition of binding of a radiolabeled standard peptide. These assays were 
performed as previously described (Sidney, J., et aL, supra.). Briefly, 1-10 nM of radiolabeled 

IS peptide was co-incubated at room temperature with 1 to 1 nM of purified MHC in the 

presence of 1 ^M human Pa-microglubulin (Scripps Laboratories, San piego, CA) and a cocktail 
of protease inhibitors. Following a two day incubation, the percent of MHC bound radioactivity 
was determined by size exclusion gel filtration chromatog^raphy using a TSK 2000 column. 
Alternatively, the percent of MHC boimd radioactivity was determined by capturing 

20 MHC/peptide complexes on W6/32 antibody coated plates, and determining boxmd cpm using 
the TopCount microscintillation counter (Packard Instrument Co., Meriden, CT) (Southwood, et 
aL, Epimmune Technical Report Epi 063-99). 

The radio labeled standard peptide utilized for the A*0201, A*0202, A*0203, A*0205, 
A*0206, and A*0207 assays was an F6 > Y analog of the HBV core 18-27 epitope (sequence 

25 FLPSDYFPSV). The average IC50 of this peptide for each molecule was 5.0, 4.3, 10, 4.3, 3.7, 
and 23 nM, respectively. A C4 > A analog of HBV pol 646 (sequence FTQAGYPAL), or 
MAGE 1 282 (sequence YVIKVSARV), was utilized as the label for the A*6802 assay. Their 
IC50S for A*6802 were 40 and 8 nM, respectively. 

In the case of competitive assays, the concentration of peptide yielding 50% inhibition of 

30 the binding of the radiolabeled peptide was calculated. Peptides were initially tested at one or 
two high doses. The ICso of peptides yielding positive inhibition were then determined in 
subsequent experiments, in which two to six further dilutions were tested. Under the conditions 
utilized, where [label]<[MHC] and IC50 ^ [MHC], the measured IC50 values are reasonable 
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approximations of the true Kd values. Each competitor peptide was tested in two to four 
independent experiments. As a positive control, the unlabeled version of die radiolabeled probe 
was also tested in each experiment. 



5 Example 4: Alternative Binding Assay 

Epstein-Barr virus (EB V)-transformed homozygous cell lines, fibroblasts, CER, or 72 1 .22 
transfectants were used as sources of HLA class I molecules. These cells were maintained in 
vitro by culture in RPMI 1 640 medium supplemented with 2mM L-glutamine (GIBCO, Grand 
Island, NY), 50|iM 2-ME, 100|ag/ml of streptomycin, lOOU/ml of penicillin (Irvine Scientific) 

to and 1 0% heat-inactivated PCS (Irvine Scientific, Santa Ana, CA). Cells were grown in 225-cm^ - 
tissue culture flasks or, for large-scale cultures, in roller bottle apparatuses. Cells were harvested 
by centrifugation at 1500 RPM using an IEC-CRU5000 centrifuge with a 259 rotor and washed 
three times with phosphate-buffered saline (PBS)(0.01 M PO4, 0.154 M NaCl pH 7.2). 
Cells were pelleted and stored at -70**C or treated with detergent lysing solution to 

15 prq)are detergent lysates. Cell lysates were prepared by the addition of stock detergent solution 
[1% OT-40 (Sig^^^ Renex 30 (Accurate Chem. Sci. Corp., Westbury, NY 1 1590), 150 mM 
NaCl, 50 mM Tris, pH 8.0] to the cell pellets (previously counted) at a ratio of 50-100 x 10^ cells 
per ml detergent solution. A cocktail of protease inhibitors was added to the premeasured 
volume of stock detergent solution immediately prior to the addition to the cell pellet. Addition 

20 of the protease inhibitor cocktail produced final concentrations of the following: 

phenylmethylsulfonyl fluoride (PMSF), 2 mM; aprotinin, 5 fig/ml; leupeptin, 10 /xg/ml; 
pepstatin, 10 /zg/ml; iodoacetamide, ICQ fiM; and EDTA, 3 ng/ml. Cell lysis was allowed to 
proceed at 4^C for 1 hour with periodic mixing. Routinely 5-10x10^ cells were lysed in 50-1 00 
ml of detergent solution. The lysate was clarified by centrifugation at 1 5,000 x g for 30 minutes 

25 at 4**C and subsequent passage of the supernatant fraction through a 0.2 fi filter unit (Nalgene). 

The HLA- A antigen purification was achieved usiiig affinity columns prepared with 
mAb-conjugated Sepharose beads. For antibody production, cells were grown in RPMI vnih 
10% PBS in large tissue culture flasks (Coming 25160-225). Antibodies were purified from 
clarified tissue culture medium by ammonium sulfate fractionation followed by affinity 

30 chromatography on protein-A-Sepharose (Sigma). Brieflyj saturated ammonium sulfate was 

added slowly with stirring to the tissue culture supernatant to 45% (volume to volume) overnight 
at 4**C to precipitate the immunoglobulins. The precipitated proteins were harvested by 
centrifugation at 10,000 x g for 30 minutes. The precipitate was then dissolved in a minimum 
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volume of PBS and transferred to dialysis tubing (Spectro/Por 2, Mol. wt. cutoff 12,000-14,000, 
Spectum Medical Ind.). Dialysis was against PBS (^0 times the protein solution volume) with 
4-6 changes of dialysis buffer over a 24-48 hour period at 4'^C. The dialyzed protein solution 
was clarified by centrifugation (10,000 x g for 30 minutes) and the pH of the solution adjusted to 
5 pH 8.0 with IN NaOH, Protein- A-Sepharose (Sigma) was hydrated according to the 

manufacturer's instructions, and a protein- A-Sepharose column was prepared. A column of 10 
ml bed volume typically binds 50- 1 00 mg of mouse IgG. 

The protein sample was loaded onto the protein- A-Sepharose column using a peristaltic 
pump for large loading volumes or by gravity for smaller volumes (<100 ml). The column was 

1 0 washed with several volumes of PBS, and the eluate was monitored at A280 in a 

spectrophotometer xmtil base line was reached. The bound antibody was eluted using 0.1 M 
citric acid at suitable pH (adjusted to the appropriate pH with IN NaOH). For mouse IgG-1 pH 
6.S was used for IgG2a pH 4.S was used and for IgG2b and IgG3 pH 3.0 was used. 2 M Tris 
base was used to neutralize the eluate. Fractions containing the antibody (monitored by A280) 

15 were pooled, dialyzed against PBS and further concentrated using an Amicori Stirred Cell system 
(Amicon Model 8050 with YM30 membrane). The ahti-A2 mAb, BB7.2, was useful for afHnity 

purification. 

The HLA-A antigen was purified using affinity columns prepared with mAb-conjugated 
Sepharose beads. The affinity columns were prepared by incubating protein- A-Sepharose beads 

20 (Sigma) with affinity-purified mAb as described above. Five to 10 mg of mAb per ml of bead is 
the preferred ratio. The mAb bound beads were washed with borate buffer (borate buffer: 1 00 
mM sodium tetraborate, 1 54 mM NaCl, pH 8.2) until the washes show A280 at based line. 
Dimethyl pimelimidate (20 mM) in 200 mM triethanolamine was added to covalently crosslink 
the bound mAb to the protein-A-Sepharose (Schneider, et al, J, Biol Chem, 257:10766 (1982). 

25 After incubation for 45 minutes at room temperature on a rotator, the excess crosslinking reagent 
was removed by washing the beads twice with 10-20 ml of 20 mM ethanolamine, pH 8.2. 
Between each one the slurry was placed on a rotator for 5 minutes at room temperature. The 
beads were washed with borate buffer and with PBS plus 0.02% sodium azide. 

The cell lysate (5-10 x 10^ cell equivalents) was then slowly passed over a 5-10 ml 

30 affinity column (flow rate of 0,1-0.25 ml per minute) to allow the binding of the antigen to the 
immobilized antibody. After the lysate was allowed to pass through the column, the column was 
washed sequentially with 20 column volumes of detergent stock solution plus 0.1 % sodium 
dodecyl sulfate, 20 column volumes of 0.5 M NaCl, 20 mM Tris, pH 8.0, and 10 column 
volumes of 20 mM Tris, pH 8.0. The HLA-A antigen bound to the mAb was eluted with a basic 
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buffer solution (50 mM diraethylamine in water). As an alternative, acid solutions such as 0.15- 
0.25 M acetic acid were also used to elute the bound antigen. An aliquot of the eiuate (1/50) was 
removed for protein quantification using either a colorimetric assay (BCA assay, Pierce) or by 
SDS-PAGE, or both. SDS-PAGE analysis was performed as described by Laemmli (Laemmli, 
5 U.K., Nature 227:680 (1970)) using known amounts of bovine serum albumin (Sigma) as a 
protein standard. Allele specific antibodies were used to purify the specific MHC molecule. In 
the case of HLA-A2, the mAb BB7.2 was used. 

A detailed description of the protocol utilized to measure the binding of peptides to Class 
I HLA molecules has been published (Sette, et ai, Mol Immunol. 31 :813, 1994; Sidney, et ai, in 

10 Current Protocols in Immunology, Margulies, Ed., John Wiley & Sons, New York, Section 1 8.3, 
1998). Briefly, purified MHC molecules (5 to 500nM) were incubated with various unlabeled 
peptide inhibitors and 1-lOnM '^^I-radiolabeled probe peptides for 48h in PBS containing 0.05% 
Nonidet P-40 (NP40) (or 20% w/v digitonin for H-2 lA assays) in the presence of a protease 
inhibitor cocktail. The final concentrations of protease inhibitors (each from CalBioChem, La 

1 5 JoUa, CA) were 1 mM PMSF, 1 .3 nM 1 . 1 0 phenanthroline, 73 |liM pepstatin A, 8mM EDTA, 
6mM N-ethyUnaleimide, and 200 |iM N alpha-p-tosyl-L-lysine chloromethyl ketone (TLCK). 
All assays were performed at pH 7.0. 

Following incubation, MHC-peptide complexes were separated firom free peptide by gel 
filtration on 7.8 mm x 15 cm TSK200 columns (TosoHaas 16215, Montgomeryville, PA), eluted 

20 at 1 .2 mls/min with PBS pH 6.5 containing 0.5% NP40 and 0. 1 % NaNa. The eiuate from the 
TSK columns was passed through a Beckman 170 radioisotope detector, and radioactivity was 
plotted and integrated using a Hewlett-Packard 3396 A integrator, and the fraction of peptide 
bound was determined. 

Radiolabeled peptides were iodinated using the chloramine-T method. A specific 

25 radiolabeled probe peptide was utilized in each assay. Typically, in preliminary experiments, 
each MHC preparation was titered in the presence of fixed amounts of radiolabeled peptides to 
determine the concentration of HLA molecules necessary to bind 1 0-20% of the total 
radioactivity. All subsequent inhibition and direct binding assays were performed using these 
HLA concentrations. 

30 Since under these conditions [label]<[HLA] and IC5o^[HLA], the measured IC50 values 

are reasonable approximations of the true Kd values. Peptide inhibitors are typically tested at 
concentrations ranging from 120 \xg/m\ to 1.2 ng/ml, and are tested in two to four completely 
independent experiments. To allow comparison of the data obtained in different experiments, a 
relative binding figure is calculated for each peptide by dividing the IC50 of a positive control for 
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inhibition, i.e. the reference peptide that is included in every binding assay, by the IC50 for each 
tested peptide (typically unlabeled versions of the radiolabeled probe peptide). For database 
purposes, and inter-experiment comparisons, relative binding values are compiled. These values 
can subsequently be converted into normalized IC50 nM values by dividing the standard 
5 historical IC50 of the reference peptide by the relative binding of the peptide of interest. This 
method of data compilation has proven to be the most accurate and consistent for comparing 
peptides that have been tested on different days, or v/ith different lots of purified MHC. 
For example, the standard reference peptide (or positive control) for the HLA-A2.1 binding 
assays described herein is the peptide having a sequence of FLPSDYFPSV, which has an 
10 average historical IC50 value of 5 nM in multiple, repeated bmding assays. This standard value is 
used to normalize reported IC50 values for HLA-A2.1 binding as described herein. Thus, a 
relative binding value of a test HLA-A2.1 motif-bearing peptide can be converted into a 
normalized IC50 by dividing the standard reference IC50 value, Le. 5 nM, by the relative binding 
value of the test HLA-A2. 1 motif-bearing peptide. 

15 Examples: Sequence and Binding Anal vsis 

Using the assay described in Example 3, a relative binding value was calculated for each 
peptide by dividing the IC50 of the positive control for inhibition by the IC50 for each test^ 
peptide. These values can subsequently be converted back into ICso nM values by dividing the 
IC50 nM of the positive controls for inhibition by iflie relative binding of the peptide of interest 

20 This method of data compilation has proved to be accurate and consistent for comparing peptides 
that have been tested on different days or with different lots of purified MHC. Standardized 
relative binding values also allow the calculation a geometric mean, or average relative binding 
value (ARB), for all peptides with a particular characteristic (Ruppert, J., et aL, "Prominent Role 
of Secondary Anchor Residues in Peptide Binding to HLA-A2. 1 Molecules," Cell 74:929-937 

25 (1993); Sidney, J., et al, "Definition of an HLA-A3-Like Supermotif Demonstrates the 

Overlapping Peptide Binding Repertoires of Common HLA Molecules," Hum ImmunoL 45:79- 
93 (1996); Sidney, J., et al, "Specificity and Degeneracy in Peptide Binding to HLA-B7-Like 
Class I Molecules," J. ImmunoL 157:3480-3490 (1996); Kondo, A., et al., "Prominent Roles of 
Secondary Anchor Residues in Peptide Binding to HLA-A24 Human Class I Molecules," J. 

30 Immunol 155:4307-4312 (1995); Kondo, A,, et a/., *Two Distinct HLA-A*0101-Specific 

Submotifs Illustrate Alternative Peptide Binding Modes," Immunogenetics 45:249-258 (1997); 
Gulukota, K., et aU *Two Complementary Methods for Predicting Peptides Binding Major 
Histocompatibility Complex Molecules," J. Mol Biol. 267:125^-1267 (1997); Southwood, S., et 
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a/., "Several Common HLA-DR Types Share Largely Overlapping Peptide Binding 
Repertoires," 7. Immunol 160:3363-3373 (1998)). 

Maps of secondary interactions influencing peptide binding to HLA-A2 supertype molecules 
based on ARB were derived as previously described (Ruppert, J. et al,, "Prominent Role of 
5 Secondary Anchor Residues in Peptide Binding to HLA-A2. 1 Molecules," Cell 74:929-937 
(1 993); Sidney, J., ei aL, "Definition of an HLA-A3-Like Supermotif Demonstrates the 
Overlapping Peptide Binding Repertoires of Common HLA Molecules," Hum ImmunoL 45:79- 
93 (1996); Sidney, J., et a/., "Specificity and Degeneracy in Peptide Binding to HLA-B7-Like 
Class I Molecules," y. Immunol 157:3480-3490 (1996); Kondo, A., et a/., "Prominent Roles of 

10 Secondary Anchor Residues in Peptide Binding to HLA-A24 Human Class I Molecules," J. 
ImmunoL 155:4307-4312 (1995); Kondo, A., et a/., "Two Distinct HLA-A*0 1 01 -Specific 
Submotifs Illustrate Alternative Peptide Binding Modes Immunogenetics 45:249-258 (1997); 
Gulukota, K., et al, *Two Complementary Methods for Predicting Peptides Binding Major 
Histocompatibility Complex Molecules," 7. Mol Biol 267:1258-1267 (1997)). Essentially, all 

15 peptides of a given size (8, 9, 10 or 1 1 amino acids) and with at least one tolerated main anchor 
residue were selected for analysis. The binding capacity of peptides in each size group was 
analyzed by determining the ARB values for peptides that contain specific amino acid residues in 
specific positions. For determination of the specificity at main anchor positions ARB values were 
standardized relative to the ARB of peptides carrying the residue associated with the best 

20 binding. For secondary anchor determinations, ARB values were standardized relative to the 
ARB of the whole peptide set considered. That is, for example, an ARB value was determined 
for all 9-mer peptides that contain A in position 1 , or F in position 7, etc. Because of the rare 
occurrence of certain amino acids, for some analyses residues were grouped according to 
individual chemical similarities as previously described (Ruppert, J. et al, supra; Sidney, J., et 

25 al, supra; Sidney, J., et al, supra; Kondo, A., et aly supra; Kondo, A., et al, supra; Gulukota, 
K., et al, supra; Southwood, S., et al^ supra). 

Frequencies of HLA-A2-SupertvDe Molecules 

To select a panel of A2-supertype molecules representative of the allelic forms most 
fi-equent in major ethnic groups, unpublished population typing data fi-om D. Mann and M. 
30 Femandez-Vina were utilized. These data were consistent with published data (Sudo, T., et al, 
"DNA Typmg for HLA Class I Alleles: I. Subsets of HLA-A2 and of -A28," Hum. Immunol 
33:163-173 (1992); Ellis, J.M., et a/., "Frequencies of HLA-A2 alleles in Five US Population 
Groups," Hum. Immunol 61 :334-340 (2000); Krausa, P., et al, "Genetic Polymoqjhism Within 
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HLA-A*02: Significant Allelic Variation Revealed in Different Populations," Tissue Antigens 
45:233-231 (1995) and Imanishi, T., et al, "Allele and Haplotype Frequencies for HLA and 
Complement Loci in Various Ethnic Groups*' Tsuji, K., et aL, (eds): HLA 1991, Proceedings of 
the Eleventh International Histo-Compatibility Workshop and Conference, Vol. 1 Oxford 
5 University Press, Oxford, pp. 1 065-1 220 (1992)), and are shown in Table 3. For the four major 
ethnic groups considered, it was apparent that seven HLA alleles represent the vast majority of 
A2 supertype alleles. Included in this group are A*0201, A*0202, A*0203, A*0205, A*0206, 
A*0207, and A*6802. Each of these alleles is present in 2% or more of the general population, 
and also occur with a frequency greater than 5% in at least one major ethnicity. Other alleles are 
10 represented with only minor frequencies of 1.3%, or less, in any one major ethnic group. 
Furthermore, none of the minor alleles are present with a frequency greater than 1 % in the 
general population. Based on these observations, A*0201, A*0202, A*0203, A*0205, A*0206, 
A*0207, and A*6802 were selected for studies defining peptide binding specificity and cross- 
reactivity in the A2-supertype. 

15 Main Anchor Positions of A2 Supertvpe Molecules 

Previous studies indicated a largely overlapping peptide binding specificity for a set of 
Class I molecules designated as the A2-supertype. Here, the main peptide binding specificity of 
A2-supertype molecules was examined in more detail. Some of these results have been 
published previously, and are shown here only for reference purposes (Ruppert, J , e/ aL, supra 

20 and Sidney, J., et al., *The HLA-A*0207 Peptide Binding Repertoire is Limited to a Subset of 
the A*0201 Rq)etoire," Hum, Immunol, 58:12-20 (1997)). 

In a first series of studies, non-conservative lysine (K) substitutions were 
introduced at every position of two peptides previously noted to bind multiple A2-supeTtype 
molecules: 1) the HCV NS3 590 9-mer peptide (sequence YLVAYQATV), and 2) the HBV core 

25 1 8 F6 > Y 10-mer analog peptide (sequence FLPSDYFPSV). These peptides were tested for 
their capacity to bind A*0201 , A*0202, A*0203, A*0205, A*0206, A*0207 and A*6802. In 
Tables 4a and 4b, binding capacities are expressed as ratios relative to the parent peptide. 
Peptides whose binding capacities are within 10-fold of the best binder are considered preferred; 
those whose relative binding capacities are 10-100-fold less than the best binder are considered 

30 tolerated. A dash ("-") indicates a relative binding of less than 0.01. In the case of the HCV NS3 
590 peptide (Table 4a), K substitutions at position 2 and the C-terminus resulted in greater than 
100-fold reduction in binding to each HLA molecule. Greater than 100-fold decreases in binding 
were also noted ui the context of A*6802 when K was substituted in positions 1 and 5. 
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Reductions in binding capacity in the 10- 1 00-fold range were noted when substitutions were 
made at several other positions, notably positions 3 and 7. When the 10-mer HBV core 18 F6>Y 
ligand (Table 4b) was investigated, greater than 100-fold reductions in binding capacity were 
again noted when the peptide was substituted at position 2 and the C-terminus. Significant 
S reductions in binding were also observed following substitution at position 7. 

Together, these data suggest that A2-supertype molecules bind both 9- and 10-mer 
peptide ligands via anchor residues in position 2 and at the C-terminus. The presence of an 
additional primary or secondary anchor towards the middle of the peptide is demonstrated by the 
fact that the binding of both the 9-mer and 10-mer peptides was usually reduced by substitutions 
10 at position 7. 



TABLE 3 

Phenotypic fi'equencies of A2-supertype alleles in four major ethnic groups 



Phenotypic frequency 



Allele 


Blacks 


Caucasians 


Orientals 


Hispanics 


Average 


A*020l 


22J 


45.6 


18.1 


37.1 


30.8 


A*6802 


12.7 


1.8 


0.0 


4.2 


4.7 


A*02()6 


OJO 


0.4 


9.3 


6.3 


4.0 


A*0207 


0.0 


0.0 


11.0 


0.0 


2.7 


A*6265 


w 


IS 


0.3 


3.0 


2.5 


A*0203 


0.0 


0.0 


8.8 


0.0 


2.2 


A*02G2 


6.4 


0.0 


0.5 


1.3 


2.0 


A*6901 


0.0 


0.7 


0.3 


1.3 


0.6 


A*021l 


0.0 


0.0 


0.0 


1.3 


0.3 


A*0212 


0.0 


0.0 


0.3 


0.8 


0.3 


A*0213 


0.0 


0.0 


0.0 


0.4 


0.1 


A*0214 


0.0 


0.0 


0.0 


0.0 


0.0 


Total 


43.1 


48.2 


45.0 


51.9 


47.1 
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TABLE 4a 
HCVNS3 590 



Sequence Relative binding capacity 



123456789 


A*0201 


A*0202 


A*0203 


A*0205 


A*0206 


A*6802 


YLVAYQATV 


1.0 


1.0 


1.0 


1.0 


1.0 


1.0 


K 


0.40 


0.050 


0.31 


0.19 


0.29 


- 


K 


- 


- 


- 


- 


- 


- 


K 


0.53 


0.093 


0.60 


0.63 


0.064 


0.022 


K 


0.36 


0.19 


0.44 


1.0 


0.41 


0.17 


K 


0.17 


0.026 


0.30 


0.23 


0.16 




K 


0.54 


0.033 


0.27 


0.24 


0.10 


0.060 


K 


0.054 


0.016 


0.32 


0.14 


0.065 


0.043 


K 


0.24 


0.13 


0.37 


0.79 


0.14 


0.13 


K 
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TABLE 4b 
HBVcorel8F6>Y 

Sequence Relative binding capacity 



1 2 3 


4 S 6 7 8 9 10 


A»020l 


A*0202 


A*0203 


A»0205 


A*0206 


A*0207 


A*6802 


F L P 


S D Y F P S V 


1.0 


1.0 


1.0 


1.0 


1.0 


1.0 


1.0 


K 




0.43 


0.75 


0.72 


0.36 


1.7 


0.24 


- 


K 




- 


- 


- 


- 


- 


- 


- 


K 




0,44 


0.39 


13 


0.27 


0.17 


- 


0.22 




K 


0.95 


0.82 


3.4 


0.61 


1.3 


1.3 


0.43 




K 


0.60 


0.75 


12 


0.60 


0.76 


0.85 


0.77 




K 


0.58 


0.70 


6.8 


0.40 


0.39 


1.8 


1.6 




K 






0.079 






0.027 






K 


0.25 


0.22 


6.1 


0.076 


0.29 


0.25 


0.092 




K 


0.14 


0.18 


0.21 


0.18 


0.25 


0.14 ^ 


0.42 




K 

















Specificity of the Position 2 and C-Terminal Anchor Residues 

Based on these results, the ligand specificity of A2-supertype molecules at position 2 and 
5 the C-terminus was analyzed using additional HCV NS3 S90 and HBV core 18 F6>Y single 
substitution analogs, and also single substitution analogs of a poly-alanifie peptide (peptide 
953.01; sequence ALAKAAAAV). For these analyses, preferred amino acids for anchor 
residues were defined as those associated with a binding capacity within 10-fold of the optimal 
residue. Amino acids whose relative binding capacity was between 0.01 and 0.1 were defined as 

10 tolerated, and those associated with a binding capacity less than 0.01 were considered as non- 
tolerated. In the accompanying tables, a dash ("-") indicates a relative binding of less than 0.01 . 
Binding capacities are expressed as ratios relative to the related analog with the highest binding 
affinity for each individual molecule. 

At position 2 small aliphatic and hydrophobic residues were found to be generally 

15 tolerated, while other residues, including large polar, aromatic, and charged residues were 
typically not well tolerated (Tables 5a, 5b, and 5c). L, I, V, and M were preferred as anchor 
residues in most (>80%) contexts (Table 5d). The allele/peptide combinations in Table 5d refer 
to the number of instances in which a given residue was associated with a relative binding in the 
1-0.1 range (preferred) or 0.1-0.01 range (tolerated). A, T, Q, and S were less frequently 

20 preferred as anchor 
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HCVNS3 590 



Relative binding capacity 











A*0205 


A* 0206 


A*6802 


V 


0.28 


0,28 


0.23 


0.36 


0.53 


0.69 


T 


0.58 


0.24 


0.11 


0.34 


0.74 


1.0 


L 


1.0 


1.0 


1.0 


0.19 


1.0 


0.029 


1 


0.49 


0.24 


0.64 


0.24 


0.8 i 


0.045 


Q 


0.91 


0.55 


0.46 


1.0 


0.57 




P 














K 














F 


0.016 




0.012 








D 















TABLE Sb 
HBVcore 18F6>Y 

Relative binding capacity 



Residue 


A*0201 


A*0202 


A*0203 


A*0205 


A*0206 


A*0207 


A*6802 


I 


0.18 


0.66 


0.41 


0.82 


1.0 


0,31 


0.53 


L 


1.0 


0.46 


1.0 


0.79 


0.36 


1.0 


0.088 


V 


0.065 


1.0 


0.10 


1.0 


0.60 


0.10 


0.91 




0.013 . 


0.35 . 


-0.025 


0.25 


0.11 




1.0 


Q 


0.26 


0.049 


0.49 


0.074 


0.15 


0.053 




F 






0.015 








0.046 


D 
















K 
















P 
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TABLE 5c 

Poly-alanine peptide ALAKAAAAV 

Relative binding capacity 



Ives I Que 








A*0205 


A*0206 


A*6802 


L 


1.0 


0.92 


0.22 


0,11 


0.49 


o.ou 


M 


0.43 


0.73 


0.70 


0.43 


0.51 


0.010 


V 


0.051 


1.0 


0.40 


1.0 


1.0 


0.68 


I 


0.063 


0.56 


1.0 


0.16 


0.44 


0.073 


T 


0.025 


0.75 


0.091 


0.20 


0.35 


1.0 


A 


0.013 


0.26 


0.070 


0.089 


0.075 


0.31 


S 




0.12 


0.023 


0.011 


0.025 


0.057 


G 




0.031 


0.011 




0.017 




P 












0.016 


C 














D 














F 














K 














N 















TABLE 5d 
Summary 

AUele/Peptide combinations 



Residue 


Tested 


Preferred 


Tolerated 


%preferred 


%tolerated 
or preferred 


V 


19 


17 


2 


89.5 


100.0 


L 


19 


16 


3 


84.2 


100.0 


1 


19 


16 


3 


84.2 


100.0 


M 


6 


5 


1 


83.3 


100.0 


T 


19 


14 


4 


73.7 


94.7 


A 


6 


2 


4 


33.3 


100.0 


Q 


13 


8 


3 


61.5 


84.6 


S 


6 


1 


4 


16.7 


83.3 


G 


6 


0 


3 


0.0 


50.0 


F 


19 


0 


4 


0.0 


21.1 


P 


19 


0 


1 


0.0 


5.3 


C 


6 


0 


0 


0.0 


0.0 


K 


19 


0 


0 


0.0 


0.0 


N 


6 


0 


0 


0.0 


0.0 


D 


19 


0 


0 


0.0 


0.0 



43 



wo 02/061435 PCTAJS02/02708 

residues, but were either preferred or tolerated in >80% of the contexts examined. None of the 
other amino acids examined were preferred in any context and only rarely tolerated. 

At the C-terminus, V was found to be the optimal residue in the context of all 3 parent 
peptides for A*0201, A*0206, and A*6802, and in 2 out of 3 cases for A*0203 and A*0205 
5 (Tables 6a, 6b, and 6c). Overall, either V or L was the optimal C-terminal residue for each 
molecule, regardless of the peptide tested. The allele/peptide combinations in Table 6d refer to 
the number of instances in which a given residue was associated with a relative binding in the 1- 
0. 1 range (preferred) or 0. 1-0.01 range (tolerated). The aliphatic/hydrophobic amino acids V, L, 
and I were preferred as anchor residues in greater than 66.7% of the MHC-peptide contexts. M, 
10 A, and T were tolerated approximately 50% of the time. Other residues examined were either 
not accepted at all, or were tolerated only rarely. 

A Re-Evaluation of the Peptide Binding Specificity of A*0201 

The fine specificity of A*020l binding was investigated in more detail using a database 
of over 4000 peptides between 8- and 1 1 -residues in length. It was found that over 30% of the 
1 5 peptides bearing L or M in position 2 bound A*0201 with affinities of 500 nM, or better (Figure 
la). Between 5 and 15% of the peptides bearing the aliphatic residues I, V, A, T, and Q bound 
with ICsos of 500 nM, or better. No other residue, including aromatic (F, W, and Y), charged (R, 
H, K, D, and E), polar (S and N) and small (C, G, and P) residues, was associated with ICsos of 
500 nM, or better. 

20 Consistent with the single substitution analysis, V was found to be the optimal A'^0201 

C-temiinal anchor residue (Figure lb). Overall, 31.9% of the peptides with V at the C-terminus 
were A*0201 binders. I, L, S, C, M, T and A were also tolerated, with 7.1 to 28.6% of the 
peptides binding with an IC50 of 500 nM, or better. 

The correlation between peptide length (between 8 and 1 1 residues) and binding capacity 
25 was analyzed next. It was found that 27.6% of the 9-mer peptides bound with IC50 of 500 nM, or 
less, in good agreement with previous estimates (Ruppert, J., et al., supra) (Tabic 7a). ARB 
values are standardized to the peptide set of optimal size and shown for reference purposes. 

Longer peptides were also capable of binding, although somewhat less well; 17.8% of 
10-mer, and 14.5% of the 1 1-mer peptides had afiSnities of 500 nM or better. Finally, it was 
30 noted that 8-mer peptides bound A*0201 only rarely, with 3.5% of the peptides having binding 
capacities better than 500 nM. 

The A*020l peptide binding database was further analyzed to assess the stringency of the 
A*0201 motif As expected, peptides with preferred residues in each anchor position bound 
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TABLE 6a 
HCVNS3 590 

Relative binding capacity 



xvesiQuc 


t\ \J£,\J 1 






A*0205 


A*0206 


A*6802 


V 


1.0 


0.83 


1.0 


0.51 


1.0 


1.0 


I 


0.22 


0.14 


0.60 


0.30 


0.17 


A ATC 

U.075 


L 


0.95 


1.0 


0.72 


LO 


0.38 


0.062 


T 


0.16 


0.012 


0.11 


0.017 


0.034 




F 


0.066 




0.044 








D 














K 














P 














Q 















TABLE 6b 
HBVcorel8F6>Y 

Relative binding capacity 



Residue 


A*0201 


A*0202 


A*0203 


A*0205 


A*0206 


A*0207 


A*6802 


I 


0.21 


0.70 


0.15 


0.19 


0.26 


0.15 


0.39 


V 


1.0 


1.0 


1.0 


1.0 


1.0 


1.0 


1.0 


L 


0.18 


0.43 


0.23 


0.26 


0.077 


0.23 


0.087 


T 


0.033 


0.045 


0.027 


0.022 


0.10 


0.027 




P 


0.023 








0.012 


0.010 




D 
















F 
















K 
















Q 
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TABLE 6c 

Poly-alanine peptide ALAKAAAAV 

Relative binding capacity 

















1 


CIS 


0.29 


0.37 


0.11 


0.10 


0.38 


V 


1.0 


0.73 


0.20 


1.0 


1.0 


1.0 


L 


0.040 


1.0 


1.0 


0.36 


0.085 


0.26 


M 


0.025 


0.18 


0.031 


0.049 


0.034 


- 


A 


0.072 


- 


0.077 


- 


- 


0,025 


S 


- 


- 


0,011 


- 


- 


- 


T 






0.043 








C 














F 














G 














N 














P 














R 














Y 















TABLE 6d 
Summary ' 

Allele/Peptide combinations 



Residue 


Tested 


Preferred 


Tolerated 


%pFeferred 


% tolerated 
or preferred 


V 


19 


19 


0 


100.0 


100.0 


I 


19 


18 


1 


93.3 


100.0 


L 


19 


14 


5 


66.7 


100.0 


M 


6 


I 


4 


20.0 


83J 


T 


19 


3 


9 


20.0 


63.2 


A 


6 


0 


3 


0.0 


50.0 


S 


6 


0 


1 


0.0 


16.7 


P 


19 


0 


3 


0.0 


15.8 


F 


19 


0 


2 


0.0 


10.5 


C 


6 


0 


0 


0.0 


0.0 


G 


6 


0 


0 


0.0 


0.0 


N 


6 


0 


0 


0.0 


0.0 


R 


6 


0 


0 


0.0 


0.0 


K 


13 


0 


0 


0.0 


0.0 


Y 


6 


0 


0 


0.0 


0.0 


D 


13 


0 


0 


0.0 


0.0 


Q 


13 


0 


0 


0.0 


0.0 
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TABLE 7a 



Binding as a function of peptide size 



Peptide length 


(n) 




% Binding peptides 


ARB 


8 


171 




3.5 


0.072 


9 


2066 




27.6 


1.0 


10 


1451 




17.8 


0.27 


11 


179 




14.5 


0.20 


Total 


3867 




22.2 




TABLE 7b 










Binding as a function of main anchor motifs 


• • • - - 




Motif 




%Binding 




Position 2 


C-tenninus 


(n) 


peptides 


ARB 


Prefored 


Preferred 


526 


48.7 


1.0 


^fored 


Tolerated 


1446 


28.4 


0.31 


Tolerated 


Preferred 


558 


17.6 


0.098 


non-tolerated 


Preferred 


27 


0.0 . 


0,031 .. . . 


Preferred 


non-tolerated 


66 


6.1 


0.026 


Tolerated 


Tolerated 


1337 


?-l 


0.026 


non-tolerated 


Tolerated 


46 


0.0 


0.015 


non-tolerated 


non-tolerated 


71 


0.0 


0.014 


Tolerated 


non-tolerated 


105 


0.0 


0.013 


Total 




4182 


20.7 





most frequently (48.7%), and with higher average relative binding capacity than other peptides in 
the library (Table 7b). Peptides with one preferred residue and one tolerated residue also bound 
relatively frequently, in the 17.6 to 28.4% range. Finally, peptides with at least one non-tolerated 
S residue, or with tolerated residues at both inain anchor positions, bound only rarely, if at all, with 
frequencies of binding in the 0-7.1% range. No significant difference was detected in terms of 
primary anchor preferences as a function of ligand size. 

To identify secondary anchor effects, the A*0201 binding capacity of peptides in each 
size group was further analyzed by determining the ARB values for peptides that contain a 
1 0 particular amino acid residue in a specific, but size dependent, position. The resulting ARB 

values, by corresponding residue/position pairs, for 8-1 1-mer sequences are shown in Tables 8a- 
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8d. All of the peptides in Tables 8a-8d had at least 1 preferred and 1 tolerated residue at the 
main anchor positions. At secondary anchor positions values corresponding to a 3-fold or 
greater increase in binding capacity are indicated by increased and bolded font. Negative effects, 
associated with a three- fold decrease in binding affinity, are identified by underlined and 
5 italicized font. Also, residues determined to be preferred or tolerated anchors are indicated by 
bold font. ARB values at the anchor positions were derived from the analyses described in 
Figure 1 . To allow use of the values shown in this table as coefficients for predictive algorithms, 
the values for non-tolerated anchor residues have been set to 0.001, equivalent to a 1000-fold 
reduction in binding capacity, to filter out non-motif peptides. 

10 In Tables 8a, 8b, 8c, and 8d, the results of the analysis of a panel of 93 S-mer peptides, 

1389 9-mer peptides, 953 l O-mer peptides, and 95 are based dn"^ - 

naturally occurring sequences from various viral, bacterial, or pathogen origin. ARB values 
shown were calculated, for example, as described in Sidney et at. , Human Immunotogy 62: 1 200 
(2001) and Sidney et aL, J, Immunology 157: 3480 (1996). For 97mef and lO-mer peptides ARB 

15 values were derived for each residue considered individually. For studies of Srmer and 1 1-mer 
peptides (Tables 8a and 8d, respectively,) ARB values were based on the grouping of chemicaUy 
sirml^^^^ 7^: 929 (1993), The average geqme^^^^^ 

binding capacity of the 8-mer, 9-mer, 10-mer, and ll-mer panels was 14420 nM, 1581 nM, 3155 
nM^ and 3793 nM, respectively. 

20 Summary maps are shown in Figures 2a-2d. In most positions, some secondary influence 

could be detected. The majority (55%) of the negative influences involved the presence of acidic 
(D and E) or basic (R, H, and K) residues. Proline (P) and large polar residues (Q, and N) were 
also frequently disruptive. While each particular size was associated with unique preferences, in 
most instances (79%) preferred residues were aromatic (F, W, or Y) or hydrophobic (L, I, V, or 

25 M). Most peptide lengths exhibited a preference for F, Y and M in position 3. Similarly, all 
peptide sizes shared a preference for aromatic or hydrophobic residues in the C-2 position. 
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TABLE 8a 
8-iner peptides 

Position (ARB) 



Residue 


1 


2 


3 


4 


5 . . 


6 


7 


8 


A 


0.47 


0.052 


2.0 


0.57 


1.8 


8.9 


0.83 


0.28 


C 


1.3 


0 0010 


0.70 


1.3 


0.59 


2.3 


1.1 


0.0010 


D 


0.23 


0.0010 


0.42 


0.43 


0.34 


0,43 


0.50 


0.0010 


. E ■ 


0.23 


p.obio 


0,42 


0.43 


0.34 


0.43 


0.50 


0;00ig 


F 


2.5 


0.0016 


1.4 


U 


0.27 


3.4 


1.2 


0.0010 


G 


1.5 


0^0010. 


17 


1.8 


2.7 


0.38 


4.8 


0.0010 


H 


,0.95 


0.0010 


0.30 


0.54 


0.61 


0.40 


0:55 


aooio- 


I 


2.4 


047 


1.4 


2.0 


9.9 


1.5 


1.0 


0.35 


K 


0.95 


0.0010 


0.30 


0.54 


0.61 


0.40 


0.55 


0.0010 


L 


2.4 


1.0 


1.4 


2.0 


9,9 


1.5 


i.o 


0.34 


M 


2:4 


0J3 


1.4 


2.0 


9.9 


1.5 


1.0 


0.13 


N 


0.90 


0.0010 


1.0 


0.51 


0.38 


0.38 


0.66 


0^0010 






0.0010 


0.38 


. .. P.4p . 


... P,J5 


0.50 


..,...3.4_^ 


0.0010 


Q 


0.90 


0.076 


1.0 


0.51 


0.38 


0.38 


0.66 


0.0010 




^ 0,95 


o.opio 


0.30 


0.54 


P-61 


0.40 


J.55 


0.0010 


s. 


1.3 


0.0010 


0.70 


1.3 


0.59 


2.3 


1.1 


0:0010 


T 


1.3 


0.075 


0.70 


1.3 


0.59 


2.3 


l.l 


0.11 


V 


2.4 


0.084 


1.4 


2.0 


9.9 


1.5 


1.0 


1.0 


W 


2.5 


0.0010 


1-4 


1.3 


0.27 


3.4 


1.2 


0.0010 


Y 


2.5 


0.0010 


1.4 


1.3 


0.2T 


3.4 


1.2 


0.0010 
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TABLE 8b 
9-mer peptides 

Position (ARB) 



Residue 


1 


2 


3 


4 


5 


6 


7 


8 


9 


A 


1.8 


0.052 


1.2 


2.3 


1.9 


0.45 


2.3 


0.80 


0.28 


C 


0.70 


0.0010 


0.57 


2.7 


1.4 


2.1 


0.86 


1.2 


0.0010 


D 


0.065 


0.0010 


1.2 


1.7 


0.84 


0.52 


0.21 


0.34 


0.0010 


£ 


0.065 


0.0010 


0.14 


1.5 


0.31 


0.58 


0.32 


1.4 


0.0010 


F 


9.1 


0.0010 


4.4 


1.1 


2.4 


2.6 


6.8 


4.1 


0.0010 


G 


0.84 


o.oaio . 


0.58 


L6 


0.69 


0.43 


0.28 


0.79 


0.0010 


H 


0.68 


0.0010 


0.79 


0.83 


3.8 


0.26 


1.7 


1.3 


0.0010 


I 


1.3 


0.17 


1.8 


0.56 


2.1 


2.0 


1.5 


0.45 


0.35 


K 


1.5 


0.0010 


0.14 


0.56 


0.57 


0.17 


0.19 


0.46 


0.0010 


L 


1.9 


1.0 


2.2 


0.70 


1.3 


2.6 


2.9 


2.1 


0.34 


M 


1.4 


0.73 


4.6 


0.20 


0.97 


1.5 


1.0 


0.30 


0.13 


N 


1.1 


0.0010 


0.78 


0.52 


0.32 


0.90 


0.47 


0.47 


0.0010 


P 


0.074 


0.0010 


0.64 


0.62 


0.47 


0.89 


1.6 


1.6 


0.0010 


Q 


0.33 


0.076 


1.2 


0.74 


1.0 


0.83 


0.62 


0.78 


0;0010 


R 


1.6 


0.00 io 


0.13 


0.47 


0.47 


0.17 


0.17 


0.49 


oioolo 


■ S 


0.99 


0.0010 


0.65 


1.2 


0.45 


0.97 


0.51 


2.0 


0.0010 


T 


0.60 


0.075 


0.53 


2.1 


0.59 


1.9 


0.98 


1.3 


0.11 


V 


0.93 


0.084 


1.2 


0.56 


1.7 


2.7 


0.75 


0.30 


1.0 


W 


0.58 


0.0010 


25 


5.1 


2.7 


1.3 


7.6 


1.9 


0.0010 


Y 


10 


0.0010 


4.3 


0.52 


3.2 


1.0 


7.4 


1.7 


0.0010 
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lO^mer peptides 

Position (ARB) 



Residue 


1 


2 


3 


. 4 


5 


6 


7 




9 


10 


A 


1.3 


0.052 


1:7 


1.6 


1.4 


1.1 


0.62 


1.2 


1.0 


0.28 


c 


0.63 


0.0010 


1.3 


1.3 


1.8 


0.51 


1.3 


2.6 


1.2 


0.0010 


D 


0.12 


0.0010 


0.85 


1.4 


1.1 


1.1 


0.39. 


0.22 


0.38 


0.0010 


E 


0.1 J 


0.0010 


0.17 


2.8 


0.28 


0.75 


0.43 


0.40 


0.92 


0.0010 


E; ■• 


4.4 


0.0010 


4.1 


1.4 


3.2 


2.3 


3.0 


5.0 


5.3 


0.0010 


G 


1.5 


0.0010 


0.44 


2.1 


0.91 


0.91 


0.81 


0.67 


1.1 


0.0010 


H _ 


0.54 


O.OOlO 


0.90 


0.76 


1.2 


0.42 


,.._0.74_ 


1.6 


0.52 


0.0010 


I 


1.4 


0.17 


3.1 


0.67 


2.4 


1,6 


2.7 


1.5 


0.57 


0.35 


K 


1.8 


0.0010 


0.13 


0.44 


0.26 


0.39 


0.48 


0.22 


0.47 


0.0010 


L 


1.9 


1.0 


3.6 


1.2 


1.3 


1.3 


4.5 


2.5 


1,2 


034 


M 


1.4 


0.73 


9.8 


1.1 


0.58 


1.7 


2.2 


4.6 


0.38 


0.13 


N 


0.58 


0.0010 


0.56 


1.4 


0.39 


1.1 


0.43 


0.33 


0.79 


0.0010 


P 


0.11 


0.0010 


0.53 


0.66 


0.40 


0.92 


0.86 


1.7 


0.85 


0.0010 




0^30 


0.076 


0.97 


0.30 


1.7 


0.48 


0.41 


6.32 


0.70 


o.obi6 


R 


1.1 


0.00 Id 


0.19 


0.35 


0.33 


0.77 


0.27 


0.17 


0.38 


0100 10 






aooio 


0.38 


0.60 


043 


0.58 


0.49 


0.87 






T 


0,83 


0.075 


0.44 


1.1 


1.6 


0.89 


1.0 


0.49 


1.2 


0.11 


V 


1.2 


0^084 


0.96 


0.54 


2.0 


2.2 


1.1 


L8 


1.4 


1.0 


w 


0.71 


0.0010 


1.8 


4.2 


3.5 


11 


2.6 


4.8 


1.5 


0.0010 


Y 


9.0 


0.0010 


7.4 


0.74 


0.67 


0,52 


2.0 


2.7 


2,0 


0.00 10 
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TABLE 8d 
1 1-mer peptides 

Position (ARB) 



Residue 


1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


11 


A 


0.34 


0.052 


1.8 


2.7 


2.4 


2.2 


1.0 


0.23 


0.074 


1.3 


0.28 


C 


2.2 


0.0010 


0.17 


0.21 


0.98 


1.4 


1.9 


0.63 


0.79 


1.4 


0.0010 


D 


0.21 


0.0010 


0.40 


12 


0.94 


0.30 


0.21 


0.25 


0.28 


1.5 


0.0010 


E 


0.21 


0.0010 


0.40 


12 


0.94 


0.30 


0.21 


0.25 


0.28 


1.5 


0.0010 


F 


1.2 


0.0010 


6.1 


0.40 


2.6 


Oil 


1.4 


8.8 


6.1 


0.17 


0.0010 


G 


3.3 


0.0010 


0.13 


1.0 


0.30 


14 


21 


5J 


0.76 


9.0 


0.0010 


:.,.„.„,«....;.... 


12 


--AOOL0... 


0.42-^ 


,0.58,- 


0.12 


0.088 


1.4 


0.51 


0.16 




0.0010 


I 


4.4 


0.17 


9.2 


1-4 


2.4 


3.7 


0.87 


2,1 


5.5 


0.83 


0.35 


K 


12 


0.0010 


0.42 


0.58 


0.12 


0:088 


1.4 


0.51 


0.16 


0.55 


O.OOlO 


L 


4.4 


1.0 


9.2 


1.4 


2.4 


3.7 


0.87 


2.1 


5.5 


0.83 


0.34 


M 


4.4 


0.73 


9.2 


1.4 


2.4 


3.7 


0.87 


2.1 


5.5 


0,83 


0.13 


' N ■ 


0.12 


O.OOI0 


0.0P2 


1.7 


0.57 


1.3 


0.19 


I;6 


1.1 


0.27 


0,0010 


P 


0.056 


0.0010 


1.7 


0.38 


1.4 


0.13 


0.35 


1-1 


0 088 


12 


o;ooio 




0.12 


0.076 


0.092 


1.7 


0.57 


1.3 


0.19 


1.6 


1.1 


0.27 " 


bjobio 


R 


12 


0.0010 


0.42 


0.58 


0.12 


0.088 


■ 1.4 


0.51 




0.33 


0.0010 




■■■2.2 


0.0010 




6.27 


6W 


'""■i.r'- 




o.(si 






'''^;^ir"^^^ 


t 


2.2 


0.075 


0.17 


0.21 


0.98 


1.4 


1.9 


0.63 


0.79 


1.4 


0.11 


V 


4.4 


0.084 


9,2 


1.4 


2.4 


3.7 


0.87 


2.1 


5.5 


0.83 


1.0 


w 


1.2 


0.0010 


6.1 


0.40 


2.6 


on 


1.4 


8.8 


6.1 


0.17 


0.0010 


Y 


1.2 


0.0010 


6.1 


0.40 


2.6 


0.11 


1.4 


8.8 


6.1 


0.17 


0.0010 ■ 



Several distinct preference patterns were also observed for peptides of a given size. For 
example, 8-mer peptides did not have any preference in either position 1 or position 3 for the 
hydrophobic or aromatic residues preferred by 9-, 10-, and 1 1-mer peptides. 1 1-mer peptides 
were unique in the preference for G in multiple positions throughout the middle of the peptide, 

5 Main Anchor Specificities of Other A2-Supertvpe Molecules 

In the next set of analyses, the niain anchor specificities of A^0202, A*0203, A*0206, 
and A*6802j four of the most prevalent A2-supertype alleles next to A*0201, was assessed. 
Peptides in the A2-supertype binding database often reflect selection using an A*0201-based 
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bias, such as the selection of only A*0201 binding peptides, or the selection of peptides scoring 
high in A*0201 algorithms. As a result, in most cases, peptide binding data for non-A'^0201 
molecules is available for only peptides with supeitype preferred and tolerated residues. Despite 
this limitation, a database of about 400 peptides was available for study. A database of sufficient 
5 size was not available to allow analysis of A*0205 and A*0207, although an analysis of the 
specificity of A*0207 has been published previously (Sidney, J., et, al, supra). 

Analyses of the position 2 specificities are summarized in Figure 3a-d. In general, V, T, 
A, I, and M were tolerated in the context of each molecule. Allele specific preferences were also 
noted. In the case of A* 0202 Q was the most preferred residue. Other residues (L, I, V, A, T 

10 and M) were tolerated, and were roughly equivalent, with ARB in the 0.08-0.30 range. By 

contrast, A*0203 had a preference for L, M and Q. .Residues V, A, I and T were associated with 

lower overall binding affinities. A third pattem was noted for A*0206, where Q, V, I, A, and T 
were all well tolerated with ARB values between 0.47 and 1.0, while L and M were less well 
tolerated. Finally, for A*6802 V and T were the optimal residues, with ARB >0.4S. A was also 

15 preferred, but with a lower ARB (0.13). Significant decreases in binding were seen with I and 
M, which had ARB between 0.050 and 0.020. L and Q were not tolerated, with ARB <0.010. 
At the C-terminus, I, V, L, A, M and T were tolerated by all A2-supertype molecules tested, with 
ARB >0.66d (Figure 4a-d). I and V were tiie two residues most preferred by each allele; V was 
the optimal residue for A*0203, A*0206, and A*6802. L was typically the next most preferred 

20 residue, f , A, and M were usually associated with lower ARB values. 

In conclusion, the position 2 and C4erminal anchor residues preferred or tolerated by 
A*0201 were also well tolerated by other A2-supertype molecules. While each allele had a 
somewhat unique pattem of preferences at position 2, the patterns of preferences exhibited by 
each allele at the C4erminus were fairly similar. 

25 Secondarv Influences on Peptide Bin(Hng to A2-Supertvpe Molecules 

The same library of peptide ligands was analyzed to determiiie the ligand size preferences 
of A*0202, A*0203, A*0206, and A*6802. Fore each allele, ARB values are standardized to the 
peptide set of optimal size. We found that for each molecule 9-1 1 raer peptides were well 
tolerated, with ARB >0.36 (Table 9 a-d). For A*0203, A*0206, and A*6802, 9-mer peptides 
30 were optimal, but lO-mers were optimal in the case of A*0202. For all alleles, 8-mer peptides 
were much less well tolerated, with ARB in each case < 0. 1 1 . 
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TABLE 9a A*0202 . 



Peptide length 


(n) 


ARB 


8 


6 


0.050 


9 


268 


0.79 


10 


120 


1.0 


11 


16. 


0.90 


Total 


410 




TABLE 9b A*0203 


Peptide length 


(n) 


ARB 




-- ' -6 - 


0.11 


9 


272 


1.0 


10 


122 


0.75 


11 


16 


0.36 


Total 


416 




TABLE 9c A*0206 


Peptide length 


(n) 


ARB 




6 


...^.....................M66 


9 


268 


1.0 


10 


120 


0.38 


11 


16 


0.66 


Total 


410 




TABLE 9d A*6802 


Peptide length 


(n) 


ARB 


8 ■;. 


6 


0.071 




268 


1.0 


10 


120 


0.60 


. . ' , 11 


16 


0.47 


Total 


410 





The influence of secondary anchor residues on the capacity of peptides to bind A*0202, 

A*0203, A*0206, and A*6802 was examined next, The number of peptides available only 
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allowed analysis of 9- and lO-mer ligands. The ARB values for 9-mer and 10-mer peptides as a 
function of the presence of a particular residue in a specific position are shown in Tables 10-13, 
and summary maps in Figures 5-8. As noted above, positive and negative effects are defined as 
associated with three-fold or greater increases or decreases in binding aflinity, respectively. 
5 In Tables 10a and I Ob, a panel of 268 9-mer peptides and a panel of 120 10-mer peptides, 

respectively, were tested for binding to the A*0202 allele. In Tables 11a and 1 lb, a panel of 272 
9-mer peptides and a panel of 122 10-mer peptides, respectively, were tested for binding to the 
A* 0203 allele. In Tables 12a and 12b, a panel of 268 9-mer peptides and a panel of 120 10-mer 
peptides, respectively, were tested for binding to the A*0206 allele. In Tables 13a and 1 3b, a 

10 panel of 268 9-mer peptides and a panel of 120 10-mer peptides, respectively, were tested for 
binding to the A*6802 allele. All peptides were based on naturally occurring sequences from 
various viral, bacterial, or pathogen origin and had at least 1 preferred and 1 tolerated residue at 
the main anchor positions. ARB values are based on the grouping of chemically similar 
residues, generally as described in Ruppert et aL, Cell 74\ 929 (1993), for example. At 

1 S secondary anchor positions values corresponding to a 3 -fold or greater increase in binding 
capacity are indicated by bolded and increased font. Negative effects, associated with a three- 
fold decrease in binding affmity, are indicated by underlined and italicized font. Also, residues 
determined to be preferred or tolerated anchors are indicated by bold font To allow use of the 
values shown in this table as coefficients for predictive algorithms, the values for non-tolerated 

20 anchor residues were set to 0.001, equivalent to a 1000-fold reduction in binding capacity, to 
filter out non-motif peptides. The average geometric binding capacity of each panel in Table 
lOa, 10b, 1 la, 1 lb, 12a, 12b, 13a, and 13b was 401 nM, 342 nM, 85 nM, 95 nM, 387 nM, 643 
nM, 838 nM, and 1055 nM, respectively. 

In general, deleterious effects were frequently (35%) associated with charged residues 

25 (D, E, R, H, or K). An additional 35% of the deleterious influences could be attributed to G or P. 
Positive influences were relatively evenly attributed to basic (R, H, K), acid (D, E), hydrophobic 
(F, W, Y, L, I, V, M) or small (A, P) residues. 

While each molecule had a distinctive pattern of preferences and aversions, some . 
common trends could be noted in the case of 10-mer peptides. For example, for all molecules Q 

30 and N were preferred in position 1 , and R, H, and K were preferred in position 8. D, E, and G 
were uniformly deleterious for 10-mer peptides in position 3, Consensus preferences or 
aversions were not noted for 9-mer peptides. 
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TABLE 10a 
9-mer peptides 



Position (ARB) 



Residue 


1 


2 


3 


4 


5 


6 


7 


8 


9 


A 


1.1 


0.16 


4.2 


1.5 


0.86 


0.23 


2.4 


1-1 


0.43 


C 


0.30 


O.OOIO 


0.71 


1.2 


2.1 


2.1 


0.95 


0.95 


0.0010 


D 


0.083 


0.0010 


0.097 


1.2 


0.78 


0.71 


0.23 


0.95 


0.0010 


E 


0.083 


0.0010 


0.097 


1.2 


0.78 


0.71 


0.23 


0.95 


0.0010 


F 


2.0 


0.0010 


2.1 


0.59 


1.9 


0.51 


o.n 


3.0 


0.0010 


G 


0.46 


0.0010 


0.66 


1.9 


0.23 


0.36 


0.71 


0.64 


0.0010 


H 


1.6 


0.0010 


0.34 


0.74 


0.58 


0.43 


1.8 


1.1 


0.0010 


I 


1.1 


0.17 


1.1 


1.4 


0.79 


2.2 


0.75 


0.41 


1.0 


K 


1.6 


O.OOIO 


0.34 


0.74 


0.58 


0.43 


1.8 


1.1 


0.0010 


L 


11 


0.081 


1.1 


1.4 


0.79 


2.2 


0.75 


0.41 


0.76 


M 


1.1 


0.14 


1.1 


1.4 


0.79 


2.2 


0.75 


0.41 


047 


N 


0.37 


O.OOIO 


0.35 


0.24 


1.8 


0.87 


1.5 


1.3 


0.0010 


P 


0.42 


0.0010 


2.8 


0.43 


0.55 


0.26 


0.75 


1.9 


0.0010 


Q 


0.37 


1.0 


0.35 


0.24 


1.8 


0.87 


1.5 


1.3 




R 


1.6 


n nniA 


0.34 


0.74 


0.58 


0.43 


1.8 


1.1 


U.UvlU 


S 


0.30 


0.0010 


0.71 


1.2 


2.1 


2.1 


0.95 


0.95 


aoolo 


T 


0.30 


0.18 


0.71 


1.2 


2.1 


2.1 


0.95 


0.95 


0.15 


V 


1.1 


0.29 


l.l 


1.4 


0.79 


2.2 


0.75 


0.41 


0.92 


W 


2.0 


0.0010 


2.1 


0.59 


1.9 


0.51 


0.77 


3.0 


O.OOIO 


Y 


2.0 


0.0010 


2.1 


0.59 


1.9 


0.51 


0.77 


3.0 


O.OOIO 
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TABLE 10b 
lO-mer peptides 

Position (ARB) 



Residue 


1 


2 


3 


4 


s 


6 


7 


8 


9 


10 


A 


1.2 


0.16 


1.1 


0.81 


1.4 


3.1 


0.56 


1.4 


2.4 


0.43 


c . 


0.27 


0.0010 


0.44 


3.0 


1.2 


0.95 


0.43 


1.6 


1.5 


0.0010 


D 


0.16 


0.0010 


0.28 


2.2 


9.1 


3.6 


2.2 


0.0077 


1.8 


0.0010 


£ 


0.16 


0.0010 


0.28 


2.2 


9.1 


3.6 


2.2 


0.0077 


1.8 


0.0010 


F 


3.9 


0.0010 


5.8 


1.3 


0.83 


2.8 


1.3 


1.5 


1.1 


0.0010 


G 


0.32 


0.0010 


0.098 


0.88 


1.0 


0.44 


0.32 


1.0 


0.59 


0.0010 


H 


2.1 


0.0010 


2.0 


0.52 


0.89 


0.21 


0.74 


9.9 


0.22 


0.00 10 


I 


0.76 


0.17 


0.85 


0.65 


0.67 


0.60 


6.7 


0.40 


0.60 


LO 


K 


2.1 


0.0010 


2.0 


0.52 


0.89 


0.21 


0.74 


9.9 


0.22 


0.0010 


L 


0.76 


0.081 


0.85 


0.65 


0.67 


0.60 


6.7 


0.40 


0.60 


0.76 


M 


0.76 


0.14 


0:85 


0.65 


0.67 


0.60 


6.7 


0.40 


0.60 


0.17 


N 


4.2 


0.0010 


0.38 


1.4 


0.66 


0.36 


0.26^ 


0.79 


0.91 


0.0010 


P 


0.46 


0.0010 


1.1 


0.091 


2.3 


2.5 


0.14 


1.2 


3.8 


0.0010 


Q 


4.2 


1.0 


0.38 


1.4 


0.66 


0.36 


0.26 


0.79 


0.91 


0.0010 


R 


2.1 


A AAIA 


2.0 


0.52 


0.89 


0.21 


0.74 


9.9 


0,22 




S 


0.27 


0.0010 


0.44 


3.0 


1.2 


0.95 


0.43 


1.6 


1.5 


0.0010 


T 


0.27 


0.18 


0.44 


3.0 


1.2 


0.95 


0.43 


1.6 


1.5 


0.15 


V 


one 


0.29 


0.85 


0.65 


0.67 


0.60 


6.7 


0.40 


0.60 


0.92 


W 


3.9 


0.0010 


5.8 


1.3 


0.83 


2.8 


1.3 


1.5 


l.l 


0.0010 


Y 


3.9 


0.0010 


5.8 


1.3 


0.83 


2.8 


1.3 


1.5 


l.l 


0.0010 
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TABLE 11a 
9-mer peptides 

Position (ARB) 



Residue 


1 


2 


3 


4 


5 


6 


7 


8 


9 


A 


0.95 


0.077 


4.4 


2.3 


1.2 


0.36 


4.3 


1.4 


0.17 


C 


0.41 


0.0010 


0.83 


1.4 


0.91 


0.86 


1.8 


1.7 


0.0010 


D 


0.42 


0.0010 


0.059 


0.73 


0.28 


0.36 


0.56 


0.64 


0.0010 


E 


0.42 


0.0010 


0.059 


0.73 


0.28 


0.36 


0.56 


0.64 


0.0010 


F 


3.3 


0.0010 


0.71 


0.55 


1.5 


0.28 


0.075 


1.3 


0.0010 


G 


1.1 


O.OOlO 


1.8 


1.5 


0.86 


1.3 


3.2 


1.2 


0.0010 


H 


0.63 


0.0010 


4.2 


0.91 


1.9 


0.71 


0.95 


0.30 


0.0010 


I 


1.1 


0.070 


0.77 


0.85 


0.63 


1.9 


1.2 


0.56 


0.56 


K 


0.63 


0.0010 


4.2 


0.91 


1.9 


OJl 


0.95 


0.30 


0.0010 


L 


1.1 


1.0 


0.77 


0.85 


0.63 


1.9 


1.2 


0.56 


0.14 


M 


1.1 


0.63 


0.77 


0.85 


0.63 


1.9 


1.2 


0.56 


0.17 


N 


0.36 


0.0010 


1.3 


0.59 


2.1 


1.3 


0.97 


1.3 


0.0010 


P 


0.0J5 


0.0010 


1.0 


0.55 


1.2 


1.8 


1.0 


4.4 


0.0010 


Q 


0.36 


6.51 


1.3 


0.59 


2.1 


1.3 


0.97 


1.3 


"0.6016 


R 


0.63 


0.0010 


4.2 


0.91 


1.9 


0.71 


0.95 


0.30 


0.0010 


S 


0.4f 


0.0010 


0.83 


1.4 


0.91 


0.86 


1.8 


l.f 


o^obrr 


T 


0.41 


0.045 


0.83 


1.4 


0.91 


0.86 


1.8 


1.7 


0.26 


V 


l.l 


0.10 


0.11 


0.85 


0.63 


1.9 


1.2 


0.56 


1.0 


W 


33 


0.0010 


0.71 


0.55 


1.5 


0.28 


0.075 


1.3 


0.0010 


Y 


3J 


O.0O1O 


0.71 


0.55 


1.5 


0.28 


0.075 


1.3 


0.0010 
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TABLE lib 

10-mer peptides 



PCT/US02/02708 



Position (ARB) 



Residue 


1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


A 


2.1 


0.077 


1.5 


1.1 


3.8 


1.3 


0.56 


1.7 


3.0 


0.17 


C 


0.68 


0.0010 


0.33 


1.0 


0.82 


0.69 


0.69 


2.2 


1.1 


0.0010 


D 


0.32 


0.00)0 


0.074 


3.7 


1.1 


2.4 


0.60 


16 


2.8 


0.0010 


E 


0.32 


0.0010 


0.074 


3.7 


1.1 


2.4 


0.60 


16 


2.8 


0.0010 


F 


8.3 


0.0010 


6.4 


0.66 


1.0 


1.3 


1.7 


0.23 


1.3 


0.0010 


G 


1.0 


0.0010 


0.32 


0.59 


0.63 


1.0 


0.33 


3.8 


2.6 


0.0010 


H 


0.75 


0.0010 


3.9 


1.4 


0.62 


0.55 


0.77 


4.7 


0.085 


0.0010 


I 


0.29 


0.070 


0.83 


0.60 


1.1 


0.57 


3.3 


0.65 


0.52 


0.56 


K 


0.75 


0.00 10 


3.9 


1.4 


0.62 


0.55 


0.77 


4.7 


0.085 


0.0010 


L 


0.29 


1.0 


0.83 


0.60 


1.1 


0.57 


3.3 


0.65 


0.52 


0.14 


M 


0.29 


0.63 


0.83 


0.60 


1.1 


0.57 


33 


0.65 


0.52 


0.17 


N 


6.0 


0.0010 


0.43 


2.8 


0.75 


1.3 


0.17 


0.89 


0.91 


0.0010 


P 


0.019 


0.0010 


0.90 


0.091 


1.1 


4.9 


3.6 


1.4 


2.5 


0.0010 






0J51 


0.43 


'1.8" " 


" 0.75^ 


1.3 


0.17 


0.89 


0.91 


' oBiF 


R 


\J.15 




3.9 


1 A 

1.4 


U.02 


0.55 


A T7 

0.77 


4.7 


/I /IDC 


U.UV'lvl 


S 


0.68 


0.0010 


0.33 


1.0 


0.82 


0.69 


0.69 


2.2 


1.1 


0.0010 


T 


0.68 


0.045 


0.33 


1.0 


0.82 


0.69 


0.69 


2.2 


1.1 


0.26 


V 


0,29 


0.10 


0.83 


0.60 


1.1 


0.57 


3.3 


0.65 


0.52 


1.0 


W 


8J 


0.0010 


6.4 


0.66 


1.0 


1.3 


1.7 


0.23 


1.3 


0.0010 


Y 


8J 


0.0010 


6.4 


0.66 


1.0 


1.3 


1.7 


0.23 


L3 


0.0010 
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TABLE 12a 
9-iner peptides 

Position (ARB) 





1 


2 


3 


4 


5 


6 . 


7 


8 


9 


A 








1 6 


0 74 

V/./ 


0.21 


1.3 


0.53 


0 If) 


c 


0.35 


U.UUiU 


0.47 


1 1 
1.1 


1.4 


0.75 


U./Z 


1.0 


0.0010 


D 


U.ol 




U.ol 


1 A 


z.z 








0.0010 


c 


u.ol 


0.0010 








1 7 




064 


0.0010 


F 


2.5 


0.0010 


1.4 


0.85 


1.9 


1.6 


2.0 


3.3 


0.0010 


G 


0.67 


0.0010 


0.33 


2.4 


0.24 


0.34 


0.81 


0.82 


0.0010 


H 


1.7 


0.0010 


0.13 


0.47 


0.62 


0.61 


0.85 


0.83 


0.0010 


I 


0.77 


0.49 


4.1 


0.82 


0.86 


2.4 


0.74 


0.46 


0.54 


K 


1.7 


0.0010 


0.13 


0.47 


0.62 


0.61 


0.85 


0.83 


0.0010 


L 


0.77 


0.061 


4.1 


0.82 


0.86 


2.4 


0.74 


0.46 


0.23 


M 


0.77 


0.18 


4.1 


0.82 


0.86 


2.4 


0.74 


0.46 


0.071 




0.48 


0:0010 


0.39 


Q;29 


2.0 


0.94 


'••1:3'"" 


1.0 


0.0010- 


P 


O.U 


0.0010 


0.47 


0.32 


0.27 


0.29 


2.1 


1.4 


0.0010 




0.48 


1.0 


0.39 


o:29 


2.0 


0.94 


"'"'t:3":^- 


1.0 


0.0010 


R 


1,7 






U.4/ 


n /CO 


U.bl 


U.oD 


U.oo 


0.0010 


S 


0.35 


0.0010 


0.47 


1.1 


1.4 


0.75 


0.72 


1.6 


0.0010 


T 


0.35 


0.47 


0.47 


1.1 


1.4 


0.75 


0.72 


1.6 


0.11 


V 


0.77 


0.53 


4.1 


0.82 


0.86 


2.4 


0.74 


0.46 


1.0 


W 


2.5 


0.0010 


1.4 


0.85 


1.9 


1.6 


2.0 


3.3 


0.0010 


Y 


2.5 


0.0010 


1.4 


0.85 


1.9 


1.6 


2.0 


3.3 


0.0010 
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TABLE 12b 
10-mer peptides 



PCTAJS02/02708 



Position (ARB) 



Residue 


1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


* 

A 


2.4 


0.52 


0.62 


1./ 






U.l/ 






U.lo 


C 


0.61 


0.0010 


0.23 


0.71 


1.4 


0.80 


0.56 


1.2 


0.78 


0.0010 


D 


0.068 


0.0010 


0,099 


2.7 


11 


3.2 


1.2 


0.38 


4.0 


U.UUIU 


E 


0.068 


U.UUIU 


0.099 


2.7 


t *t 
11 


3,2 


1.2 


0,3o 






r 


1 n 
J.U 


0.0010 


A 1 


U.oU 






i.o 






0.0010 


G 


0,71 


0.0010 


0.072 


0.81 


0.61 


0.48 


0.71 


0.73 


0.41 


0 0010 


H 


1.4 


0.0010 


0.17 


0.56 


0.66 


0.86 


0.96 


5.0 


0.25 


0.0010 


I 


0.42 


0.49 


3.8 


0.67 


0.76 


0.90 


4.9 


0.79 


1.0 


0.54 


K 


1,4 


0.0010 


0.17 


0.56 


0.66 


0.86 


0,96 


5.0 


0.25 


0.0010 


L 


0.42 


0.061 


3.8 


0.67 


0.76 


0.90 


4.9 


0.79 


1.0 


0.23 


M 








V.v/ 


0 76 


0 90 


49 


0 79 


1 0 


n 071 

U.U/ X 


Kt 


O.l 


0.0010 




1 ft 

i.o 


U.f«/ 




Old 


0 70 




"'0.0010 


P 


0,17 


0.0010 


0.84 


1.2 


0.57 


0.83 


0.26 


1.3 


3.6 


0.0010 




6.1 


i.6 


0.28 


1.8 


0.47 


6.82 


0.14 


0.20 


0^34 


6.0010 


R 


1.4 


0.0010 


0.17 


0.56 


0.66 


0.86 


0.96 


5.0 


0.25 


0.0010 


S 


0,61 


0.0010 


0.23 


0.71 


1.4 


0.80 


0.56 


1.2 


0.78 


0.0010 


T 


0.61 


0.47 


0.23 


0.71 


1.4 


0.80 


0.56 


1.2 


0.78 


0.11 


V 


0.42 


0.53 


3.8 


0.67 


0.76 


0.90 


4.9 


0.79 


1.0 


1.0 


W 


3,0 


0.0010 


4.1 


0.80 


1.2 


2.6 


1.8 


2.1 


0.45 


0.0010 


Y 


3.0 


0.0010 


4.1 


0.80 


1.2 


2.6 


1.8 


2.1 


0,45 


0.0010 
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TABLE 13a 
9-nier peptides 

Position (ARB) 



Residue 


1 


2 


3 


4 


5 


6 


7 


8 


9 


A 


0.36 


0.13 


6.8 


0.98 


0.71 


0.14 


3.4 


0.71 


0.15 


C 


1.0 


0.0010 


0.42 


0.92 


0.95 


1.7 


0.60 


0.75 


0.0010 


D 


352 


0.0010 


0.30 


0.70 


0.28 


0.70 


0.36 


0.45 


0.0010 


e; ■ . 


352 


0.0010 


0.30 


0.70 


0.28 


0.70 


0.36 


0.45 


0.0010 


' F ' 


7.6 


0.0010 


2.7 


1.4 


1.8 


2.3 


1.5 


2.1 


0.0010 


G 


0.054 


0.0010 


0.24 


2.5 


0.48 


0.53 


0.85 


1.9 


0:0010 


,. H 


0.16 


.0.00I0,_ 


^0,27.„- 


^0.55-' 


0.68 


3.2 


3.2 


. 1.5 


Q.OOIQ 


I 


2.2 


0.052 


0.88 


1.3 


1.1 


0.80 


0.65 


0.57 


0.80 


K 


0.16 


0.0010 


6.27 


0.55 


0.68 


3.2 


3.2 


1.5 


0.0010 


L 


2.2 


0.0078 


0.88 


1.3 


1.1 


0.80 


0.65 


0.57 


0.32 


M 


2.2 


0.023 


0.88 


1.3 


1.1 


0.80 


0.65 


0.57 


0.093 


. N 


0.83 


0.0010 


1.6 


0.45 


0.36 


0.71 


0.46 


1.8 


0.0010 


P 


0.49 


0.0010 


2.8 


0.43 


24 


2.3 


0.71 


1.7 


0.0010 


Q 


0.83 


0.0010 


1-6: 


0.45 


0.36 


0.71 


0.46 


1.8 


0.0010 


R 


0.16 


0.0010 


0.27 


0.55 


0.68 


3J 


3.2 


1.5 


n An 1 A 


S 


1.0 


o.dbio 


0.42 


0.92 


6^95 




tM 






T 


1.0 


0.45 


0.42 


0.92 


0.95 


1.7 


0.60 


0.75 


0.062 


V 


2.2 


1.0 


0.88 


1.3 


l.l 


0.80 


0.65 


0.57 


1.0 


w 


7.6 


0.0010 


2.7 


1.4 


1.8 


2.3 


1.5 


2.1 


0.0010 


Y 


7.6 


0.0010 


2.7 


1.4 


1.8 


2.3 


1.5 


2.1 


0.0010 



62 



wo 02/061435 PCTAJS02/02708 
TABLE 13b 
1 0-mer peptides 



Position (ARB) 



Residue 


1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


A 


0.50 


0.13 


5.6 


3.5 


2.7 


0.69 


0.71 


1.3 


1.4 


0.15 


C 


2.1 


0.0010 


1.4 


1.4 


0.20 


0.72 


0.26 


1.1 


0.55 


0.0010 


D 


3.2 


0.0010 


0.042 


4.8 


4.3 


0.68 


0.28 


0.10 


1.2 


0.0010 


E 


3.2 


O.OOlO 


0.042 


4.8 


4.3 


0.68 


0.28 


0.10 


1.2 


0.0010 


F 


1.1 


0.0010 


2.7 


1.4 


1.3 


1.5 


4.9 


0.98 


2.2 


O.OOlO 


G 


0.086 


0.0010 


0.16 


0.38 


2.1 


0.54 


1.5 


1.5 


0.66 


O.OOlO 


H 


0.73 


0.0010 


0.16 


0.15 


0.70 


0.18 


3.8 


3.1 


0.88 


0.0010 


I 


1.2 


0.052 


1.2 


1.2 


2.8 


1.8 


1.7 


0.96 


0.74 


0.80 


K 


0.73 


0.0010 


0.16 


0.15 


0.70 


0.18 


3.8 


3.1 


0.88 


0.0010 


L 


1.2 


0,0078 


1.2 


1.2 


2.8 


1.8 


1.7 


0.96 


0.74 


0.32 


M 


1.2 


0.023 


1.2 


1.2 


2.8 


1.8 


1.7 


0.96 


0.74 


0.093 


N 


16 


0.0010 


0.22 


1.5 


0.20 


8.4 


3.2 


0.31 


1.6 


0.0010 


P 


115 


0.0010 


0.17 


0.045 


0.090 


0.60 


0.12 


0.96 


1.8 


0.0010 


Q 


16 


0.0010 


0.22 


1.5 


0.20 


8.4 


3.2 


0.31 


1.6 


0.0010 


R 


0.73 


0.0010 


0.16 


0.15 


0.70 


0.18 


3.8 


3.1 


0.88 


U.UUlU 


S 


2.1 


0.0010 


1.4 


1.4 


6.20 


0.72 


0.26 


1.1 


0.55 


dooib 


T 


2.1 


0.45 


1.4 


1.4 


0.20 


0.72 


0.26 


1.1 


0.55 


0.062 


V 


1.2 


1.0 


1.2 


1.2 


2.8 


1.8 


1.7 


0.96 


0.74 


1.0 


W 


1.1 


0.0010 


2.7 


1.4 


1.3 


1.5 


4.9 


0.98 


2.2 


oiooio 


Y 


1.1 


0.0010 


2.7 


1.4 


1.3 


1.5 


4.9 


0.98 


2.2 


0.0010 



In summary, the data in this section describe detailed motifs for 9- and lO-mer peptides 
binding to A*0202, A*0203, A*0206, and A*6802. Each motif is characterized by specific 
features associated with good, or poor, binding peptides. 

A Consensus A2*Supemiotif 

The motifs described above for A2 supertype molecules are very similar and largely 
overlapping. In this respect, a consensus motif can be identified that incorporates features 
commonly shared by the molecule-specific motifs (Figure 9). The consensus motif specifies the 
presence of hydrophobic and aliphatic residues in position 2 of peptide Ugands. At this position, 

63 



wo 02/061435 PCTAJS02/02708 
V, L and M are preferred, while T, Q, A, and I are all tolerated. On the basis of the preference 
rank of each residue in the context of each A2-supertype molecule, V is the most preferred 
residue. At the C-terminus the consensus motif specifies the presence of hydrophobic and 
aliphatic residues L, I, V, M, A, and T. V is most frequently the optimal residue, while L and I 
5 are also considered preferred, typically being the next most optimal residues. M, A, and T are 
considered as tolerated residues. 

The secondary anchor maps for A*0201 , A*0202, A*0203, A*0206, and A*6802 were 
utiUzed to derive a supertype consensus secondary anchor motif for 9- and IG-mer peptides 
(Figure 9). Residues considered as preferred for 3 or more A2-supertype molecules, without 
10 being deleterious for any molecule, were considered as preferred for the supertype consensus 

- - motif Conversely, residues identified as deleterious for 3 or more molecules were designated as 

deleterious in the consensus motif. The consensus motif overlaps significantly with the detailed 
A*0201 motif, and includes a preference for aromatic residues in position 1 and/or 3, and a 
shared aversion for charged residues in position 3. 

15 Correlation Between A*0201 Binding Affinity and A2-Supertvpe Cross-Reactivity 

Because of the dominance in four major ethnicities of A*0201 compared with other A2 
supertype alleles (see, e.g., table 3), it was of interest to determine how well A*62bl binders 
also bound to other A2-supertype molecules. It was found that peptides that bound A*0201 with 
good affinity (fcso <5^^ 

20 Between 36. 1 and 73.6% of A*0201 binding peptides bound other A2-supertype molecules. 
Analysis of A2-supertype degeneracy as a function of A*0201 affinity also yielded interesting 
results. 72.8% of the peptides that bound A*0201 with IC50 <500 nM bound 3 or more A2- 
supertype molecules (Table 14b). As a general rule, the higher the binding dfinity of a peptide 
for A*0201, the higher the likelihood that the peptide would also bind 3 or more supertype 

25 molecules. Over 96% of the peptides that bound A*0201 with affinities of 20 nM or better also 
bound 3 or more A2-supertype molecules. By contrast, A2-supermotif peptides that did not bind 
A*0201 with affinities better than 500 nM only rarely (10%) bound 3 or more A2 supermotif 
molecules, and never bound 4 or more molecules. 

In summary, this analysis of the cross-reactive binding of peptides to A*0201 and other 

30 A2-supertype molecules confirms the fact that this family of HLA molecules recognizes similar 
structural features in their peptide ligands. It has also been shown that A*0201 binding affinity 
correlates with the propensity to bind muhiple A2-supertype alleles. 
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TABLE 14a 



Crossreactivity between A2-supertype molecules 







% binders crossreacting with: 






Allele 


A*0201 


A*0202 


A*0203 


A*0206 


A*6802 


Average 


A*0201 




54.9 


73.6 


50.2 


36.1 


53.7 


A*0202 


54.9 




50.2 


38.7 


26.2 


42.5 


A*0203 


73.6 


50.2 




42.7 


30.0 


49.1 


A*0206 


50.2 


38.7 


42.7 




24.3 


39.0 


A*6802 


36.1 


26.2 


30.0 


24.3 




29.2 


TABLE 14b 














Degeneracy of A*0201 binders 










A*0201 




A2-supertype alleles bound (% of peptides) 




affinity 


0 


1 


2 3 


4 


5 


>=3 


<=20 


0.0 


0.0 


3.5 17.5 


36.8 


42.1 


96.5 


<=100 


0.0 


3.6 


11.2 21.4 


34.7 


29.1 


85.2 


<=500 


0.0 


7.1 


20.1 25.1 


28.3 


19.3 


72.8 


>500 


40.0 


33.3 


16.7 10.0 


0.0 


0.0 


lO.O 



Analysis 



The results of this analysis allow for the detailed definition of the properties of peptides 
that bind to HLA-A*0201 and other A2-supertype molecules. The A2-supertype molecules 
share not only largely overlapping peptide binding specificity, but also significantly overlapping 
5 peptide binding repertoires. Specific features of peptide ligands associated with degenerate A2- 
supertype binding capacity were identified which provide a logical explanation for the supertype 
relationship. 

In a previous study the peptide binding specificity of A*020l was analyzed, and a 
detailed motif, including the identification of secondary anchor features, was constructed. In the 
10 present analyses, performed with a 10-fold larger database, we confirmed that data and extended 
the analysis to include 8- and 1 l-mer peptides. Overall, the specificity of A*020l for 8- and 1 1- 
mer peptides was largely similar to that for 9- and 10-mer peptides. For example, regardless of • 
peptide size, the majority of negative influences on binding cajpacity were associated with the 
presence of charged residues in secondary anchor positions, while the majority of positive 
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influences were associated with the presence of hydrophobic residues. The definition of detailed 
motifs for 8- and 1 1-mer peptides should allow for a more complete identification of epitopes. 
Identification of A*0201 binders has been greatly facilitated by the use of the algorithms based 
on ARB values. In the present analyses a substantially larger database was used than previously 
5 available, allowing for a refinement of algorithm coefficients. Because the newer coefficients 
are based on a significantly larger data set, they are statistically more accurate and should afford 
more efficient and precise prediction of epitopes. Indeed, recent analysis has shown that a 
revised A*0201 9-mer polynomial algorithm based on a larger data set is more accurate than 
both an older algorithm based on a small data set, and neural network prediction methodologies. 

10 In addition to increasing the accuracy of epitope prediction (Ruppert, J., et al, supra; Sidney, J., 
et aLy supra; Kondo, A., et al, supra; Gulukota, K., et al., supra; Parker, K.C., e/ aLy "Sequence- 
Motifs Important for Peptide Binding to the Human MHC Class I Molecule, HLA-A2," J. 
Immunol 149:3580-3587 (1992) and Milik, M., et al, "Applicatioti of an Artificial Neural 
Network to Predict Specific Class I MHC Binding Pq)tide Sequences," Nature (Biotech) 16:753- 

15 756 (1998)), detailed peptide binding motifs defining both primary and secondary anchor 
positions allow for the rational design of optimized ligands. For example, natural sequences 
carrying sub-optimal residues at primary and/or secondary positions can be identified. The sub- 
optimal residues may be replaced with optimal anchors, generating epitopes with increased 
binding affinity (Sidney, J., et al, supra; Pogue, R.R., et al,, "Amino-Terminal Alteration of die 

20 HLA-A*0201 -Restricted Human Immunodeficiency Virus Pol Peptide Increases Complex 

Stability and in Vitro Immunogenicity," Proc, Natl Acad. Sci., USA, 92:8166-8170 (1995) and 
Bakker, A.B., et al^ "Analogues of CTL epitopes With Improved MHC Class-I Binding 
Capacity Elicit Anti -Melanoma CTL Recognizing the Wide-Type Epitope," Int, J. Cancer^ 
70:302-309 (1997)). Following this type of modification, wild type peptides that were unable to 

25 elicit responses, or were poor immunogens, may become highly immunogenic Pogue, R.R., et 
al, supra; Bakker, A.B., et al, supra; Parkhurst, M.R., "Improved Induction of Melanoma- 
Reactive CTL With Peptides From the Melanoma Antigen gplOO Modified at HLA-A*020l- 
Binding Peptides," J. Immunol 157:2539-2548 (1996); Rosenberg, S.A., et al, "Immunologic 
and Therapeutic Evaluation of a Synthetic Peptide Vaccine for the Treatment of Patients With 

30 Metastatic Melanoma," Nature (Med) 4:321-327 (1998); Sarobe, P., et al, "Enhanced in vitro 
Potency and in vivo Immunogenicity of a CTL Epitope From Hepatitis C Virus Core Protein 
Following Amino Acid Replacement at Secondary HLA-A2,1 binding positions," 7. Clin, Invest, 
102:1239-1248 (1998) and Ahlers, J.D., et al, "Enhanced Immunogenicity of HlV-1 Vaccine 
Construct by Modification of the Native Peptide Sequence," Proc. Nat 7 Acad ScL, USA, 
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94: 1 0856- 1 086 1 ( 1 997)). The CTL induced by such analog peptides have been shown to be 
capable, in most instances, of recognizing target cells expressing wild type antigen sequences. 
This phenomenon is likely to reflect less stringent epitope binding requirements for target cell 
recognition compared to that needed for stimulation of nai ve T-cells to induce differentiation 
5 into effectors (Cho, B.K., et al., "Functional Differences Between Memory and Naive CDS T 
Cells " Proc. Nat 1 Acad ScL USA 96:2976-2981 (1999); Sykulev, Y., et ai, "Evidence That A 
Single Peptide - MHC Complex On A Target Cell Can Elicit Acytolytic T Cell Response," 
Immunity 4:565-571 (1996)). Thus, the detailed motifs described herein will facilitate not only 
in the identification of naturally occurring CTL epitopes, but also in the design of engineered 

10 epitopes with increased binding capacity and/or inmiunogenic characteristics. 

The peptide binding specificity for other A2-supertype molecules was also investigated 
using single substitution analog peptides and peptide libraries. In agreement with previous 
reports (del Guercio, M-F, et aL, "Binding of a Peptide Antigen to Multiple HLA Alleles Allows 
Definition of an A2-Like Supertype," / Immunol 154:685-693 (1995) and (Sidney, J., et aL, 

15 "Practical, Biochemical and Evolutionary Implications of the Discovery of HLA Class I 
Supermotifs,"//n/?iMTO/ Today 17:261-266 (1996)); see also reports filed forNIH-NLMD 
contract NOl-AI-45241), we found that the primary anchor motifs of A2-supertype molecules 
were remarkably similar. The use of peptide libraries allowed detailed characterization of the 
secondary anchor preferences and aversions of each molecule. It was shown that, while each 

20 A2-supertype molecule had a unique specificity, a supermotif based on consensus patterns could 
be identified. Because the supermotif describes features of peptide ligands that are shared 
amongst A2-supertype molecules, it is expected to allow the efficient identification of highly 
cross-reactive peptides, and indicate appropriate strategies for anchor fixing, allowing 
modulation of the supertype degeneracy of peptide ligands. A further result of the present 

25 analysis was the derivation of coefficients that could be utilized in algorithms for predicting 
peptide binding to A*0202, A*0203, A*0206, and A*6802. 

As HLA A*0201 is by far the most prevalent A2-supertype allele, both in the general 
population and within major ethnic groups, the peptide screening strategy that was utilized 
focused first on the identification of A*0201 binders. It was determined that over 70% of the 

30 peptides that bind to A*0201 also bind to at least 2 additional A2-supertype molecules, and that 
the propensity to bind other A2-supertype alleles correlated with A*0201 binding affinity. 

In conclusion, the data described herein provide formal demonstration of the shared 
peptide binding specificity of a group of HLA-A molecules designated as the A2-supertype. Not 
only do these molecules recognize similar features at primary and secondary anchor positions of 
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their peptide ligands, they also share largely overlapping peptide binding repertoires. The 
demonstotion that these molecules share largely overlapping repertoires has a significant 
implication for the design of potential vaccine constructs, hideed, the concept that A2-supertype 
cross-reactivity at the peptide binding level may be of immunological relevance has been 
5 demonstrated in a number of studies, in both infectious disease (Khanna R., et aL, "Identification 
of Cytotoxic T-Cell Epitopes Within Epstein-Barr Virus (EBV) Oncogene Latent Membrane 
Protein 1 (LMPl): Evidence for HLA A2 Supertype-Restricted Immune Recognition of EBV- 
Infected Cells by LMPl-Specific Cytotoxic T lymphocytes/'^wr J Immunol, 28:45 1-458 (1998); 
Bertoletti, A., et aly "Molecular Features of the Hepatitis B Virus Nucleocapsid T-Cell Epitope 
10 18-27: Interaction With HIA An T-Cell Receptor;vi/e;7aro/o^ 

Responses in Humans Expressing Different HLA-A2 Supertype Molecules," Hum Immunol 
60:1013-1017, (1999); Bertoni, R., et aU "Human Histocompatibility Leukocyte Antigen- 
Binding Supermotifs Predict Broadly Cross-Reactive Cytotoxic T Lymphocyte Responses m 

15 Patients With Acute Hepatitis "J C/w Inv^t 100:503-513 (1997); and Doolan, D.L., et a/., 

"Degenerate Cytotoxic T-Cell Epitopes from P. falciparum Restricted by Multiple HLA-A and 
HLA-B Supertype Alleles," //nwM/iify 7:97-1 12 (1997)) and cancer (Fleischhauer, K., et al, 
"Multiple HLA-A Alleles Can Present an Immunodominant Peptide of the Human Melanoma 
Antigen Melan-A/MART-1 To A Peptide-Specific HLA-A'^0201+ Cytotoxic Cell Line," J 

20 Immunol, 157: 787-797 (1996); Rivbltini, L., etaL, 'binding and Presentation of Peptides 
Derived From Melanoma Antigens MART- 1 and Glycoprotein- 1 00 by HLA-A2 Subtypes: 
Implications for Peptide-Based Immunotherapy," J /m/7iw/Jo/ 156:3882-3891 (1996); 
Kawashima, L, "The Multi-Epitope Approach for Immunotherapy for Cancer: Identification of 
Several CTL Epitopes from Various Tumor- Associated Antigens Expressed on Sohd Epithelial 

25 Tumors," i/M/w/iPMmM/io/ 59:1-14 (1998)) settings. 

Example 6: Peptide Composition for Prophvlactic Uses 

Vaccine compositions of the present invention are used to prevent infection or treat 
cancer in persons. For example, a polyepitopic peptide epitope composition containing multiple 
CTL and HTL epitopes is administered to individuals at risk for HCV infection. The 
30 composition is provided as a single lipidated polypeptide that encompasses multiple epitopes. 
The vaccine is adnainistered in an aqueous canier comprised of Freund'^^ 
The dose of peptide for the iiiitial immunization is from about 1 to about 50,000 jig for a 70 kg 
patient administered in a human dose volume. The initial administration of vaccine is followed 
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by booster dosages at 4 weeks followed by evaluation of the magnitude of the immune response 
in the patient, by techniques that determine the presence of epitope-specific CTL populations in a 
PBMC sample. Additional booster doses are administered as required. The composition is 
found to be both safe and efficacious as a prophylaxis against HCV infection. 
5 Alternatively, the polyepitopic peptide composition can be administered as a nucleic acid 

in accordance with methodologies known in the art and disclosed herein. 

The above discussion is provided to illustrate the invention but not to limit its scope. 
Other variants of the invention will be readily apparent to one of ordinary skill in the art and are 
encompassed by the appended claims. All publications, patents, and patent applications cited 
1 0 herein are hereby incorporated by reference. 
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CLAIMS 

What is claimed is: 

1. A method for identifying a HLA-A2 supermotif-restricted peptide, comprising: 
contacting a peptide consisting of 8-1 1 amino acids, wherein the amino acid at position 

5 two from the N-terminus of the peptide is L, I, V, M, A, T, or Q and the C-terminal amino acid is 
L, I, V, M, A, or T, with three or more of the HLA molecules encoded by A*0201 , A*0202, 
A*0203, A*0204, A*0205, A*0206, A*0207, A*68G2, and A*6901 alleles; 
measuring IC50 values; and 

identifying a peptide that binds at least three HLA molecules with an IC50 value less than 
10 500 nM as a HLA-A2 supermotif restricted peptide. 

2. The method of claim 1 , wherein the amino acid at position two of the peptide is 
V,A,T,orQ, 

15 3 . The method of claim 1 , wherein the amino acid at position two of the peptide is L, 

I,M,orQ. 

4. The method of claim 1, wherein the amino acid at position two of the peptide is I 

or Q. 

20 

5. The method of claim 57, wherein the C-terminal amino acid is L, I, V, M, A, or T. 

6. The method of claim 1 , wherein the C-terminal amino acid is T. 

25 7. The method of claim 1 , wherein the peptide is derived from an HIV antigen, HBV 

antigen, HCV antigen, HPV antigen, PSA antigen, Epstein-Barr virus antigen, KSHV antigen, 
Lassa virus antigen, MT antigen, p53 antigen, CEA antigen, TSA antigen, MAGE antigen, or 
Her2/neu antigen. 

30 8. A method for identifying an inununogenic HLA-A2 supermotif-restricted peptide, 

comprising: 
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contacting a peptide consisting of 8-1 1 amino acids, wherein the amino acid at position 
two from the N-terminus of the peptide is L, I, V, M, A, T, or Q and the C-terminal amino acid is 
L, I, V, M, A, or T to fomi peptide/HLA-A2 complexes, with three or more of the HLA 
molecules encoded by A*0201, A*0202, A*0203, A*0204, A*0205, A*0206, A*0207, A*6802, 
5 andA*6901 alleles; 

determining whether the peptide/HLA-A2 complexes induce a CTL response, and 
identifying a peptide that induces a CTL response in complex with at least three of the 
HLAs as a HLA-A2 supermotif restricted peptide. 

10 9. The method of claim 8, wherein the amino acid at position two of the peptide is 

V,A,T,orQ. 

10. The method of claim 8, wherein the amino acid at position two of the peptide is L, 
I,M,orQ. 



15 



20 



1 1. The method of claim 8, wherein the amino acid at position two of the peptide is I 

orQ. 

12. The method of claim 8, wherein the C-terminal amino acid is L, I, V, M, A, or T. 

13. The method of claim 8, wherein the C-terminal amino acid is T. 



14. The method of claim 8, wherein the peptide is derived from an HIV antigen, HBV 
antigen, HCV antigen, HPV antigen, PSA antigen, Epstein-Barr virus antigen, KSHV antigen, 

25 Lassa virus antigen, MT antigen, p53 antigen, CEA antigen, TSA antigen, MAGE antigen, or 
Her2/neu antigen. 

15. A method for making a HLA-A2 supermotif-restricted peptide, comprising: 
providing an amino acid sequence of an antigen of interest; 

30 identifying within the sequence a putative T-cell epitope, wherein the putative epitope 

consists of 8-1 1 amino acids, wherein the amino acid at position two from the N~terminus of the 
epitope is L, I, V, M, A, T, or Q and the C-terminal amino acid is L, I, V, M, A, or T, 

preparing one or more peptide fragments of the antigen of interest that comprise the 
epitope; 
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contacting the peptide with three or more of the HLA molecules encoded by A*0201, 
A*0202, A*0203, A*0204, A*0205, A*0206, A*0207, A*6802, and A*6901 alleles; 
measuring IC50 values; and 

selecting a peptide that binds at least three HLA molecules with an IC50 value less than 
5 500 nM as a HLA- A2 supermotif-restricted peptide. 

1 6. The method of claim 1 5, wherein the amino acid at position two of the peptide is 
V,A,T,orQ. 

10 17, The method of claim 1 5, wherein the amino acid at position two of the peptide is 

L, I, M, or Q. 

1 8. The method of claim 1 5, wherein the amino acid at position two of the peptide is I 

orQ. 

15 

19. The method of claim 15, wherein the C-tenninal amino acid is L, I, V, M, A, or T, 

20. The method of claim 1 5, whiereiri the C-lirminal ainirid acid is T. 

20 21. The melhod-ofclaim 15/wherd^^ 

Epstein-Barr virus, KSHV, Lassa virus, MT, p53, CEA, TSA, MAGE, or Her2/neu. 

22. A method for making an immunogenic HLA-A2 supermotif-restricted peptide, 
comprising: 

25 providing an amino acid sequence of an antigen of interest; 

identifying within the sequence a putative T-cell epitope, wherein the putative epitope 
consists of 8-1 1 amino acids, wherein the amino acid at position two from the N-tenninus of the 
epitope is L, I, V, M, A, T, or Q and the C-terminal amino acid is L, I, V, M, A, or T, 

preparing one or more peptide fragments of the antigen of interest that comprise the 
30 epitope; 

determining whether the peptide/HLA-A2 complexes induce a CTL response, and 
selecting a peptide that induces a CTL response in complex with at least three of the 
HLAs as a HLA-A2 supermotif restricted peptide. 
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23. The method of claim 22, wherein the amino acid at position two of the peptide is 

V,A,T,orQ. 

24. The method of claim 22, wherein the amino acid at position two of the peptide is 
5 L,I,M,orQ. 

25. The method of claim 22, wherein the amino acid at position two of the peptide is I 

orQ. 

10 26. The method of claim 22, wherein the C-terminal amino acid is L, I, V, M, A, or T. 

27. The method of claim 22, wherein the C-terminal amino acid is T. 

28. The method of claim 22, wherein the antigen is HIV, HBV, HCV, HPV, PSA, 
1 5 Epstein-Barr virus, KSHV, Lassa virus, MT, p53, CEA, TS A, MAGE, or Her2/neu. 
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